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PHAGOCYTIC ASSAY METHOD 



The present invention relates to the field of 

programmed cell death or apoptosis and in particular 

to the phenomenon whereby apoptotic cells are rapidly 

phagocytosed or engulfed by other cells. 

. /Specifically, the invention provides assays and 
^i' - " ■ . ■ - ■ . 

materials' for use therein, which measure phagocytosis 

... of. . apoptotic cells . Such assays can be used to 

identify chemical substances which influence the 

pri^gocytic uptake of apoptotic cells . and have 

potential pharmacological activity. The assays of the 

invention. are well adapted for medium-and high- 

throughput screening using a multi-well plate format . 



During development and maintenance of living tissues a 
large. number of cells undergo programmed- cell death, or 
;/apoptpsi observed in both vertebrates and 

.y-inyertebrates . For example, it has. been shown that in 
: -vthe?lnematode C. eiegans 131 cells undergo programmed 
cell.j^ieath (Lui and Hengartner (1997) early 1997 
Interna Worm Meeting, Abstract 371) . . Lysis of 

the apoptotic cells is potentially harmful since their 

- -l -'r • ■ .- ■ '■ ' ■ ■ . - 

.. .92B.t^t s ... way cause toxic damage to the surrounding 
tissues. It has been observed that this, harmful 
-effect; is avoided because apoptotic cells are engulfed 
..and degraded by other cells. In mammals 

x - ttC' engulfing cells may be professional or semi- 
^ : : ■prpfess;l6nal phagocytes such as neutrophils or 
■ ^macrophages or. they may be neighbours of the dying 
' cells". . 



A key feature of the process of programmed cell death, 
or apoptosis, is the efficiency with which the dying 
cells are recognized and engulfed by phagocytes 
(Savill, J.et. al, Immunol Today, 14:131-136, 1993.). 
Apoptosis triggers a distinct sequence of events 
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characterized by the expression of phosphatidylserine 
on the cell surface, DNA fragmentation or laddering 
and the release of membrane-bound cell fragments 
called apoptotic blebs and bodies .('Cohen,. J. J.et. al, 
Annu Rev Immunol, 10:267-293, 1992.;Kerr J.F.R.et. al, 
Br J Cancer, 26:239, 1972.). Apoptotic cells and 
bodies are phagocytosed via various receptors that 
recognize, phosphatitdylserine and other undefined 
ligands unique to the surface of apoptotic material 
(Savill> J. S.et. al, J Clin Invest, 83 : 865-875, 
l?89.;Fadok, V* A.et. al, J Immunol, 148:2207-2216, 
1992.;Savill, J.et. al, Nature, 343 : 170-173, 1990. ) . 
In this way, apoptotic cells, which contain 
potentially inflammatory factors, are rapidly cleared 
by neighboring cells acting as semi-professional 
phagocytes or voracious experts of the macrophage line- 
without inducing an inflammatory response (Fadok, V. 
A.et. al, J Clin Invest, 101:890-898, 1998.). 

The- process of apoptosis has been associated with :.a 
number of human diseases, including cancer, autoimmune 
diseases, various neurodegenerative diseases, such as 
Amyotrophic Lateral .Sclerosis, Huntingdon's disease 
and Alzheimer/ s disease, stroke, myocardial infarction 
and AIDS . (Thompson, CB, Science 267, pp 1456-1462) . 
Thus, much attention has been focused on elucidating 
the mechanism of apoptosis and the genes controlling 
it with a view to developing new therapeutic 
strategies for these diseases . : 

Particular diseases' have been associated with an 
impairment of phagocytosis of apoptotic bodies. 
Examples of such diseases include autoimmune diseases 
such as systemic lupus erythematosus, (Herrmann, M.et. 
al, Arthritis Rheum, 41:1241-1250, 1998.) , AIDS 
(Zocchi, M. R.et. al, AIDS, 11:1227-1235, 1997.), 
acute pulmonary infections (Cox, G.et. al, Am. J 
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. Respir.Cell Mol. Biol . ,- 12 : 232-237,. 1995*) and allergy 
(Ying, S.et. al, Proc Assoc Am Physicians, 109:42-50, 
.1997.);. It is clear that modulation of phagocytosis of 

- * , _ 

^apoptotic cells by drugs is a promising strategy for 
.. future, therapies . 

Phagocytosis of apoptotic cells in vertebrates has 
been observed to be a very complicated process arid how 
any signal generated by the* dying cell is received and 
transduced by the engulfing cell is not understood. 

\» " - . .... . . 1 . ' - . . - 

. A swift engulfment of apoptotic cells is observed in 
the hermaphrodite C. el egans and this worm has 
provided a useful tool for study of the engulfment 
process. For example, six genes have been identified 
in C. .elegans as effecting engulfment known as ced-1, 
ced-2, ced-5, ced-6, ced-7 and ced-10. Of these ced-6 
has been singled out by the present inventors for 
.part study. I.t is known that ced-6 maps to. 

• :cj|romo^ III near: daf-4 in 'C'.;.". el egans (Lui and : ; 
Henga'rtner (1997) ; Lui and Hengartner (1996) East 
Coast Worm Meeting Abstract 128). that work showed 
that two cosmids from this region, F5'6D2 and F43F12 
.^ould .rescue C. el egans with a ced-6 (nl813) 
-<. eKgulfment defect. A 10 kb Xho I subclone from F56D2 
with rescuing activity was identified as carrying the 
ced-6 gene . . 

UThe preserit inventors have identified two. human 
: \fib5pib.gu6s of the C. elegans ced-6 (hlced-6 and 

--i* *„ V" * • ,Ji , . ■ . ■ - 

h2ced-6) , h2ced-6 being a. splice variant of hlced-6 
and. thought to be a dominant negative version thereof . 
, Both hoitiologu^s have been shown to be present in the 
human cell-line THP-1. A surprising degree of 
sequence homology between hCED- 6-. and C. elegans CED-6 
has been found. hCED-6 has a phosphotyrosine binding 
domain from about amino acid position il to about 
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amino acid position 190 as shown in Figure 4 
suggesting its involvement in a tyrosine kinase signal 
transduction pathway. 

5 hlCED-6 and h2CED-6 proteins and their encoding 

nucleic acids are useful for carrying out assays as' 
described herein to identify compounds which are 
inhibitors or enhancers of a signal transduction 
pathway which promotes phagocytosis of apoptotic 

10 cells. In particular they are useful for identifying 
inhibitors or enhancers of hlCED- 6 and h2CED-6 or 
inhibitors or enhancers of the transcription thereof. 
Such inhibitors or enhancers may be useful therapeutic 
agents in the treatment of some of the aforementioned 

15 diseases . 

In accordance with its first aspect the invention 
provides an expression vector capable of expressing 
hlCED- 6 or h2CED-6 which vector comprises a sequence 

20 .of cleoxynucleotides encoding the amino acid 

of Figure 4 or Figure 5 or an amino acid sequence 
which differs from the amino acid, sequence of Figure 4 
or Figure 5 only in amino acid changes which are 
conservative of biological function. 

25 ■ ' ; , . , . : " .v . 

The term "biological function" is defined herein to 
mean the ability to regulate or affect phagocytosis of 
apoptotic cells. Amino acid changes which are 
. u cons ervat i ve " are t ho s e - which p^ rmi t . b io 1 ogi ca 1 : 

30 function to be retained although it may be less than 
or. greater than the level "of biological function of.', 
the: wild-type human CED-6 protein. . Such conservative 
changes may include insertion or deletion of one or 
more amino acids or substitution of one or more amino 

35 ..acids with another .amino acid or acids having similar 
chemical characteristics.. The choice of. amino acids 
for making conservative changes will be. well-known to 
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those skilled in the art. 

In -a preferred embodiment the expression vector is one 
comprising the sequence of deoxynucleotides shown from 
■the; transcription start codon to the transcription 
stop, codpn shown in Figure. 2 or Figure 3 and 
optionally a vector comprising the sequence of 
.deoxynucleotides shown in Figure 2 or Figure 3. 



In a. particularly preferred embod iment the expression 

: t < ? \ V" >*■ * , ■ ' ■ '■ ■ • 

.vector of the invention comprises a sequence of 
^nucleotides encoding a reporter gene positioned so 
that. expression of hlCED-6 or h2CED-6 results in 
expression of the reporter gene. The. reporter gene 
may be : positioned 3' or 5' to said hlced-6 or h2ced-6 
arid may be expressed as a fusion protein with hlCED-6 
or- h2GED-6. ■ Suitable reporter genes are those which 



express a fluorescent product such as • green 



fluorescent protein (GFP) . Other suitable reporter 



.gejie^ enzyme's , such as p^galactosidase or 
: ^lutelf erase, which are capable of acting oh; a substrate 
to produce a detectable product/ for example a 



_ NT- 



-fluorescent product or luminescent product. Examples 
of expression vectors in accordance with the invention 
.^e"'pQA3 103 and pGA3104 which are shown in Figures 29 
and • 1 0 respectively . 



In another preferred embodiment the expression vector 
; o 4 f • t ri e : ' i ri v e n t i o n : e k p r e s s e s an epitope tag at the amino 



and/or carboxy terminal of the hlCED-6 or h2CED-6 : 
■ protein. - An example is the plasmid pBAD/HisA-hlced-6 



the DNA sequence of which is shown in : Figure 17 . 

It will be understood that the expression vectors 
described above will comprise not. only nucleic acid 
encoding hlCED-6^or h2CED-6 or functional variants 
thereof but also regulatory sequences operably linked 
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to said nucleic acid, such as promoter regions that 
are capable of effecting expression of the DNA 
fragments* The term "operably linked" refers to a 
juxtaposition wherein the components described are in 
a relationship permitting them to function in their 
intended manner. 

Regulatory elements required for expression include 
promoter sequences to bind RNA polymerase and to 
direct an appropriate level of transcription 
initiation and also translation initiation sequences 
for ribosome binding. For example, a bacterial 
expression vector may include a promoter, such as the 
lac promoter and for translation initiation the Shine- 
Dalgarno sequence and the start codon AUG. Similarly, 
a eukaryotic expression vector may include a 
heterologous or homologous promoter for RNA polymerase 
.II, a -downstream polyadenylation signal, the start 
codon AUG, and. a termination codon for detachment of 
the ribosome . Such vectors may be obtained * 
commercially or be assembled from the sequences 
described by methods well known in the art. Promoter 
sequences which may be used in the expression 
vectors of the invention include, a HSP, CMV, SV40, 
EF-la, UbC, SG, RSV, TRE/minCMV, HSV TK, .5' , LTR and 
QBISP136 enhanced CMV. . . \ 

Examples of expression vectors described herein are 
plasitiids but may also be virus: or phage vectors /: Such 
•vectors will normally possess an origin of replication 
and one or more selectable markers' such. as. a gene for 
antibiotic resistance* It is particularly preferred 
that the expression vectors of the invention are 
suitable for transfection of mammalian cells and 
therefore may be provided, with a selectable marker 
accordingly. 
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.'. In accordance with a second aspect the invention 

provides a mammalian cell-line transfected with any of 
the* expression vectors described above. Methods of 
transfecting mammalian cells are well-known to those 
5 skilled in the art. The cell-line may be one which is 
capable of growing in monolayer culture or in 
suspension culture. Suitable cell-lines are 
fibroblast cell-lines or epithelial cell-lines such as 
COSl,. BHK21, L929, pcl2, CV1, SWISS3T3, HT14 4, IMR32, 

10 HEPG2, MDCK, MCF7, 293, Hela, A549, SW48 or G361. 

Primary cell-lines such as human dermal; FIBs> dermal 
keratinocytes, leucocytes, monocytes, lyphocytes, 
dendritic cells or- macrophages may also be used. 
Particularly preferred for use in the phagocytosis 

15 assays of the invention are mammalian professional or 
semi-professional phagocytes of which examples are 
mouse macrophage cell-line J774 or human monocyte 
cell-line THP1 which has been shown to express hlCED-6 
and h2CED-6 (see Example 3 and Figure 6) . Both of the 

20 . : a^ o ve,. ;c ell-lines may be. re f erred to as monocyte cell- 
lines since monocytes are capable of differentiating 
. into" macrophages which is the form in which they are 
used for the assay of the invention. 

25 ' In accordance with a third aspect the invention 

provides a method for determining whether a compound 
is an . enhancer or an inhibitor of a signal 
.transduction pathway which promotes phagocytosis of 
apoptotic cells which method comprises exposing- 

30 mafea;iian : cells transfected with hlced-6 or h2ced-6 as 
described above to apoptotic particles, and measuring 
thie rate of phagocytic uptake of said particles by 
said transfected cells in the presence or absence of 
said compound. The test compound is preferably added 

35 prior to addition of said apoptotic particles. 

Suitable apoptotic particles are cells such as 
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neutrophils, lymphocytes, erythrocytes, lymphocytes or 
dendritic cells which have been rendered apoptotic and 
are optionally opsonized by exposure to serum. Cells 
suitable for forming the apoptotic particles include 
the cell-lines L929 and PC12.- A particularly 
preferred cell-line for use as an apoptotic particle 
is the growth factor dependent ■• mouse cell-line Ba/F3. 
These may be grown in standard culture medium as 
described in, Example 5 and can be rendered apoptotic 
* by growth in the absence of the growth factor IL-3 for 
a suitable period (for example about 20 hours) prior 
to use. The apoptotic status of the cells can be 
determined using, for example, an annexin/propidium 
iodide labelling kit available from Boeringher 
Mannheim (Brussels, Belgium). Cells are considered 
early apoptotic if they are about 20% annexin positive 
and less than about 5% propidium iodide negative. 

The PC 12 cell-line may be. rendered apoptotic by 
. growth in standard medium in. the absence of nerve 
growth factor. 

As an alternative to the cells described above the 
apoptotic particles could be a non-living material 
such as dye-labelled * latex beads. O.lpM, lyM, 4pM and 
lOpM beads that have either an amino or carboxylate 
group are available from Sigma-Aldrich, Bornem, 
Belgium, or Molecular Probes, Eugene, USA. 

. . - - , . ■ * . ■ *■ ■ ■ " * . ' 

In order for the assay described to be suitable for 
high- throughout compound screening it is- preferred 
that . the apoptotic . particles bear some kind of 
detectable label so that it will be readily apparent 
that the particles have been taken up by the 
transfected. mammalian cell and so that this can be 
quantified. The inventors have found that this may be 
easily achieved by. stably transfecting the cells. 
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comprising the apoptotic. particles with an expression 
■ vector comprising a reporter gene. A particularly 

suitable reporter gene encodes to be p-galactosidase 
: which 'Is- capable of cleaving the fluorogenic substrate 

fluorescein di-b-D-galactopyrano,ide * to a fluorescent 
.compound which may be monitored using standard 

fluorescence detection equipment. Other fluorescent 
: sjpbstrates are available for (J-galactosidase . A 

plasmid pcDNA3 . 1/HIS/lacz, which expresses S- " 
; galactosidase. and is suitable for transfecting cells 
.us^d as apoptotic particles, for example Ba/F3, is 
• shown in Figure 11.. Other suitable reporter genes are 

those; encoding fluorescent proteins such as green 

.fluorescent' protein or proteins capable of generating 

a ;r luminescent signal such as lucifera.se. . Plasmids, 
\pEGFP-N3 or PEGFP-C2, available from Clontech, are 
-.-'* ¥ul^able for transfecting cells used as apoptotic 
. part and are shown in Figures 7, 8, 26 

. afid 27 . A plasmid M PGL Control" available from 

V i •- s rte a t*--r.. «... ' 'i v 1 • " . . . - ... - . ■ , . •. ■ 

^ij^^i^ga : is suitable for. transfecting used " 

- as ; appp t b t i c pa r t i c 1 e s such as Ba/ F3 cells with a gene 

- -■j'-.v.\-<';.s 1 .*?,.f r * . ' . ' . . ■ . , . , .. ■ 

encqdihg lucif erase . The DNA. sequence of "PGL 
control" is shown in Figure 19. 

\ It will be- appreciated that the choice of reporter 
-gene /for the apoptotic particles may be governed by 
-fche; presence of any reporter gene in the trans fee ted 
.vm^ramaliah cells. For example, the presence of the 
. :VsM|:;rep6rter.- gene ; in . the transected' m amma 1 i an eel 1 s 
-trans fee ted with hi. or h2 ced-6. and the apoptotic - 

- i 4 r _ y ^ - i - * * . - ■ + «■ - 

cells' is not pri/na jfacie desirable because of overlap 
of signals /: although this may not. always be the case. - 

i.t will further be appreciated that a wide variety of 
compounds can be tested to see whether they are 
inhibitors or enhancers of signal transduction 
pathways which promote phagocytosis of apoptotic 
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cells. The compound may be of any chemical formula, a 
polymer or a monomer. For example the test compound 
may be genomic DNA, cDNA, RNA, PNA, a protein or 
polypeptide, an amino acid, nucleoside or nucleotide. 
The compound may be one of known biological or 
pharmacological activity, a known compound without 
such activity or a novel molecule such, as might be 
present in a combinatorial library of compounds. 

It will be appreciate that where any compound the 
presence of which results. in no or a decreased amount 
of engulfment of apoptotic particles by the 
trans fected mammalian cells, those cells must be 
tested for viability. The presence of viable cells 
will confirm that lack of engulfment is due to the 
effect of the test compound on phagocytic activity and 
not just non-specific toxicity. 

Furthermore, any compound identified as an inhibitor 
°f an enhancer of phagocytosis of apoptotic cells by 
the assay described above will be further tested to 
establish whether the effect is medicated through CED- 
6. In the case of. a compound identified as an . 
enhancer of phagocytosis of apoptotic cells this can 
be achieved by carrying out a phagocytosis assay 
exactly as described above with mammalian cells which 
are not transfected with hlced-6 or h2ced-6. If the 
compound is able to induce a phenotype in the 
untransfecteci . cells which is 'Sii^ 

of those cells when transfected; with hlced-6 then it 
is an indication that the compound in question exerts 
its effect via CED-6 or via the ■ /CED-6 signal 
transduction pathway. ' 

Similarly, if a compound identified in the above 
described assay is an inhibitor of phagocytosis of 
apoptotic .cells it can be confirmed whether its effect 



WO 99/64586 PCT/EP99/04043 

- 11 - 

■'is' via CED-6 or the CED-6 signal transduction pathway 
by examining the phenotype of the transfected 
; mammalian- cells exposed to the compound. Reversion to 
a wild-type- phenotype is an indication of action via 
CED-6 .or the CED-6 signal transduction pathway. 

..In a fourth of its .aspects the invention: provides, a 
method for determining whether a compound- is an 
inhibitor or an enhancer of a signal transduction 
pathway which promoter phagocytosis of -apoptotic cells 
Whibh .method comprises the steps of:. 

. . (1) micro-injecting into a mammalian cell a 

human CED-6 protein comprising the sequence 
of amino acids shown in Figure 4 or Figure 5 
or a sequence of amino ids differing from 
' that shown in Figure 4 or Figure 5 only in 
amino aid changes conservative of function, 
/../.::••• and ; . • 

^<^f'^{2) exposing the mammalian cell produced in . step 
: v-- ' (1 ) to apoptotic particles and measuring the 
• ' : - : V" rate of phagocytic uptake of said particles 

V by- said cell in the presence and absence of 

: . v r^ : ; - said compound. . 

Preferably, the mammalian cell is micro^injected- with 
.-affusion protein comprising hlCED-6 or h2CED-6 and a 
; y,]^ozter ]qenej which; may be any one of the reporter. 
geries. described above . Preferred fusion- proteins are 
obtainable- by expression from the GFP.and h iced- 6 ■ 
.encoding sequences of the plasmids shown in. Figures 9 
or 28. . 

All of the preferred features and embodiments 
described, above -for assays with transfected mammalian 
cell-lines can be applied to cells micro-injected with 
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hlCED-6 or h2CED-6 or fusions thereof as described . 
above . 

In a fifth aspect the invention provides a method for 
determining whether a compound is an inhibitor or an 
enhancer of a signal transduction pathway which 
promotes phagocytosis of apoptotic cells which method 
comprises the steps of: 

(1) micro-injecting or transfecting into a 
mammalian cell a vector expressing RNA 
antisense to all .or a portion of the 
sequence , of nucleotides' shown in Figure 2 or 
Figure 3; 

(2) exposing the mammalian cell produced in step 
(1) to apoptotic particles and measuring the 
rate of phagocytic uptake of said particles 

. by said cell in the presence or absence of 
said compound.:. 

Preferably, the antisense DNA comprises a sequence of 
nucleotides which is capable of hybridizing to a 
sequence of nucleotides as shown, in Figure 2 or Figure 
3 under conditions of stringency which are higher than 
2xSSC; 0.1%SDS; 25°C to 50°C, 

All of the preferred features and embodiments 
described, above for assays, with trans fected mammalian 
cell lines can be applied to the cell-lines injected 
with antisense RNA. 

It will be appreciated that the .transfected mammalian 
cells for use in the assays described above may be 
transfected with hlced-6 or h2ced-6. Since" h2ced-6 is 
thought to be a dominant negative version of hlced-6 
having an opposite biological effect, transfected 
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" cells can be chosen depending on whether it is desired 
to identify compounds which are inhibitors or 
enhancers of . apoptotic cell phagocytosis . For 

• example cells transfected with hlced-6 would be 

5 particularly suitable for identifying inhibitors of 
phagocytosis of apoptotic cells while cells 
transfected with h2.ced-6 would be particularly 
, suitable for identifying enhancers. 

10 It is hereby stated that the invention also relates to 
dny .compound . identified as an inhibitor - or enhancer 

• of a^ signal transduction pathway which promotes 

; phagocytosis- of apoptotic cells as identified in 

accordance with any of the assay methods described 
15 herein. / . ; 

*The Nucleotide sequences for hlced-6 and h2ced-6 are 
shown in. Figures 2 and 3 respectively. In addition 

• qMs. encoding the. alterative splice h2CED-6. and the 

20 ;: : ;d^er't to reconstitute : hl^ed- 6 from h2ced-6 have beWn ' . 
■ '-'^deppsi'ted at the Belgian Coordinated 'Collections of 

- ,'5* >' f : .* -' : * ' . : . ■ . - ^ - * _ - 

- T.*r- -r l * ^ * ' . 1 - ■>■ . k . - ... ■ ' ,* 

Micropr gani sms ( BCCM ) at. Laboratorium voor Moleculaire 
/"\^bi;ologie - plasmidencollectlve (LMBP) B 9000, Gent, 
Belgium in accordance with the Budapest Treaty on 8th 
25 June 1998 and hae been accorded the -Accession Nos 3868 
'and 38 69 respectively. 



35 



/ Sequences can be obtained in both deposits using T3 or 
;-f : ; T;7.- E^niers : (either .one or both can be used, .they; are; v 
30 . ; : atydi sites of the ' actual insert) 1 . Both "are 

commercially available from Clontech (-1227' and-1228) 
, "and sequence is shown below 



T7 pr ime r : 5 ' ( T AAT AC G AC T C AC TAT AG GG AGA ) 3 ' 



T3 primer: 5' (ATTAACCCTCACTAAAGGGA) 3 
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In addition to developing assays based on mammalian 
cells which over or under express human CED-6 protein 
the present inventors have identified epitopes of 
hlCED-6 and have generated useful antibodies thereto. 

Therefore in a sixth aspect the invention comprises a 
fragment of human CED-6 protein having the amino* acid 
sequence as shown in Figure 4 wherein said fragment 
includes the sequence of amino acids HRAFSRKKDKTC, 
FLGSTEVEQPKGTE or TRNGTQPPPVPSRST . Antibody 
preparations have been prepared which comprise 
antibodies to one or more of the above epitopes. Such 
antibody preparations are obtainable by the method 
described in Example 6 and their specificity is 
demonstrated by the Western Blots carried out in 
Example 7 (see Figures 20 to 25) . 

The antibodies described above may be used in a method 
of diagnosing a disease in a patient which is.,,, 
associated' with over or under expression o f human CED- • . 
6 in phagocytic cells. Specifically, there is 
provided a method for diagnosing a disease associated 
with the over or under expression of human CED-6 
protein in phagocytic cells in an individual which 
comprises : 

(a) obtaining a sample of phagocytes from said 
individual; 

(b) . exposing said phagocytes to an antibody 

preparation as described above;" 

(c) quantitatively measuring the presence of any 
immune complexes formed between said 
antibodies and said CED-6 protein; and 

. (d) comparing the amount - of immune complex 
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formed with that .formed using phagocytes 
from a control individual. 



5 The antibodies described above may be further used in 
assays for determining whether a compound is an 
inhibitor or an enhancer of a signal transduction 
. pathway- which promotes phagocytosis of apoptotic- 
cells. • Specif ically, : there, is provided a method for 
iO determining whether a compound is an inhibitor or an 
enhancer of a signal .transduction pathway which 
p'romptes phagocytosis of apoptotic cells which method 
^comprises: 



15 (a) exposing a mammalian cell transfected with 

. * -J,.. an expression vector as described above to 

the compound to be tested; 

. . .. (b) exposing said mammalian cell to an antibody 
20 ^y^^^^y* : preparation as described above; . 



' ^ '- ■ 



; ; : : :<c) quantitatively measuring the presence of any 

immune complex formed between said 
antibodies . and protein expressed by said . 
25 :^ , cells; and 

(d) comparing : the level of immune complex 
? "- detected with the amount of immune' complex 
. . transfected as 

30 ' ' described in step (a) which has riot been 

exposed, to said compound, . . . : 

, In" the above described method the mammalian cell may 
be selected from COS I, BHK2 1 , L929, CU1, SWISS 3T3, 
35 HT1.44,. IMR32, HEPG2, MDCK, M0F7 , 293, hela, A549, SW48 
or G3.61. with C0S1 cells being particularly preferred. 

. Alternatively, . the - mammalian cell is a human dermal 
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FIB, dermal keractinocyte, leucocyte, monocyte, 
hyphocyte, dendritic cell or macrophage. Preferred 
are professional phagocytes such as mouse macrophage 
cell-line J774 or human monocyte cell-line THP-1, 

Other uses for the antibodies of the inventions 
include purification of hlCED-6 and identification of 
proteins interacting with CED-6 so that the signal 
transduction pathway can be characterised, detecting 
over or under expression, cellular localization or 
post-translational modifications of hCED-6, epitope 
mapping and identification of active sites and 
pharmaceutical compositions comprising said antibodies 
in a suitable carrier or diluent. 

In accordance with a seventh aspect the invention also 
provides a method for diagnosing a disease associated 
with the over- or under-expression of human CED-6 in 
phagocytic cells in an individual, which method 
comprises: '.■ 

(a) obtaining a sample of phagocytes from said 
individual, 

(b) isolating RNA from said phagocytes, 

(c) preparing cDNA from said RNA, 

(d) performing a first PCR reaction on said 
cDNA, 

(e) performing a second (nested) PCR on the 
reaction product of said first PCR reaction, 

(f ) - quantitatively and qualitatively measuring 

the. presence of human ced-6 RNA by analysing 
the reaction products from the first and 
second PCR and 

(g) comparing the amount and type of reaction 
products, formed in the first and second PCR 
with that of the reaction products formed 
using phagocytes from control individuals.. 
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Preferably, the PCR is performed with primers derived 
from the sequence of hlced-6 or h2ced-6 as defined 

* • * 

herein or derived from the vector. used in the 

■ * 

. generation of cDNA. In particular said first PCR may 
be performed with primers having nucleotide sequences : 

' 1) cgcaaggatccccatgaaccgtgcttttagcaggaag 
2) gatctactaggtactggag 

The second PCR may be performed with primers having 
nucleotide sequences: 

1) . cgcaaggatccccatgaaccgtgcttttagcaggaag 

2) gcggatggtaccgtcgactgctgatacttgagttattctcag 

The /-assay methods described herein, developed by the 
present inventors for determining whether compounds 

are t /enhancers or inhibitors of human CED-6 have been 

- '-^ * ; '-•--».■» ■ ■ ■ ■ . - ' 

' jri.-TVi'"',' 1 ''' ... . ... "' - . 

//Vf o^nd/.tQ;! be more generally applicable for identifying ' 
:^;^co1^qtxncis which influence phagocytosis of apoptotic 
cells mechanism, not necessarily related to 

human CED-6 or the signal transduction pathway of 
which it forms a component. 

" T - * * . 

* ■ - - * 

* ' t 

/ Liu et al . (Liu, Y.et. al, The American Associatian of 

■Ir^uhologists, 1:1999.) describe an assay for 
./identifying compounds which influence phagocytosis of 
■^'appptp^lc* cells, . .in which, varying concentrations of 
the, compound to be tested are added to the phagocytes 
which are subsequently seeded with apoptotic cells. To 
quantify phagocytosis of apoptotic cells, the authors 
/used a microscopic quantification of phagocytosis in 
which the uptake of apoptotic cells was shown by 
electron microscopy and counted by light microscopy 
with a minimum of cells per slide being counted 
(Savill, J. S.Wyllie, A. H.Henson, J. E.Walport, M. 
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J.Henson, P. M.Haslett, C, J Clin Invest, 83:865-875, 
1989.). 

The presently known techniques for quantitating the' 
5 phagocytosis of apoptotic cells do not readily lend 
themselves to high throughput screening of compounds 
for potential pharmacological activity. This is 
largely because the . known assay techniques rely on 
microscopic counting of the . proportion of phagocytes 
10 which have ingested apoptotic cells when exposed to 
the test compound. However, the present assays 
overcome this drawback because they can be performed 
in the multi-well assay format and provide detection, 
systems which do not involve microscopy. 



15 



20 



Thus, in accordance with a further aspect of the 
invention there is provided a method of identifying a 
compound which is an enhancer or inhibitor of 
phagocytosis of apoptotic cells which comprises: 



a) exposing a mammalian professional or semi- 
professional phagocyte to an apoptotic mammalian cell 
which has been stably transfected with a reporter gene 
capable of generating a signal detectable without 

25 microscopy, in the presence of absence of the. compound 
to be tested, 

b) removing any apoptotic cells which are not. 
engulfed by said phagocytes and 

30 : ' ; '.' "V : ■ \ ■ ' 

c) detecting any signal of , the , reporter gene from 
said phagocytes; 

wherein any difference in signal in the presence of 
35 said compound compared to the. signal in the absence of 
said compound is an indication that said, compound is 
an inhibitor or an enhancer of phagocytosis of 
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apoptotic. cells. . 

Usually, in the above method it is preferable to 
... incubate the phagocytic cells with, the test compound 

5 prior to addition of the apoptotic cells, A suitable 

- ' - v 

incubation time might be about 15 to 30 minutes. 

* ■ - * 

Suitable phagocytic cells for carrying out the method 
;;Qf the invention are mouse. J7 7 4 cells- or human THP-1 
0 cells . as described elsewhere herein. These cells are 
.monocyte . cell-lines but are cultured so as to 
•differentiate them into macrophages prior to addition 
of, apoptotic cells .. 

5 The apoptotic cells for use in the above method may 
be.apoptotic neutophils, apoptotic lymphocytes or 
; : apoptotic erythrocytes. The apoptotic cells may 
^optionally be opsonised by exposure to serum. 
-J^^f r ^d apoptotic cells are the adherent; cell-lines 
0 ■■7-^92^:pr:. ?C1 arid /. . in particular/ the, growth factor ' . 
: .v ; V:;:d&pendant mouse cell-line Ba/F3 described elsewhere 

: - ::>r^,: " ..... :. ■ ..... - - • . ... " - ; ■ . . 

herein. 

.--»*■-■'.. • 

■ ■" t -I". .*'..*- - , . ' . • ' . . 

• - .A * - > i <- ' ■ ^ ; . - 1 " ; • • - 

;-, ,.' A-'"* ." * . . . .... " . . • . - 

The apoptotic cells are stably trahsfected with a ' 
.reporter gene of the types and using the methods 
/idescr pied above . : One particular problem which can 
arise with the. use of reporter genes is that 
. ,• >:II#Y e s s ion of the gene in an apoptotic cell, which is 
v >^£f^6 1 i ve 1 y dying, - can; be much" less - than in awfully " 
. - ; vi^le cell > If viable cells are present amongst the 
apoptotic particles added to the /phagocytes and there 
rs^;ihadequate washing of the unphagocytosed particles 
the signal from the viable/ cells will mask any ' signal 
.. from the apoptotic phagocytosed cells. Athough it is 
possible to ensure that adequate washing occurs the 
inventors have developed a particular embodiment where 
this problem is avoided. * 
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Specifically, this involves using a reporter gene 
expressing a protein, preferably an enzyme, with a low 
turnover in the cell such that the living cell and the 
apoptotic particles have approximately the same 

5 protein concentration or enzymatic activity. This 
overcomes the drawbacks described above. Several 
possible reporter proteins and substrates have been 
described in Handbook of fluorescent probes and 
research chemicals, ed by P., Haugland (Molecular 

0 probes, Eugene, OR, USA) which may be used. However, 
the inventors have found (S-galactosidase (lacZ) to be 
particularly suitable. The enzyme has a relative slow 
turnover and it is shown that the cells and the 
apoptotic particles have relatively equal amounts of 

5 activity. Furthermore several substrates exist for p- 
galactosidase (see molecular probes, Eugene, OR, USA), 
from which the inventors have used FDG mentioned 
above. This makes it possible to develop, a high 
throughput screen to select for compounds that alter 

0 the phagocytosis of apoptotic particles. 

The phagocytes for use in the. method of the invention 
may by wild- type cells or they may be transgenic or 
mutant cells. A mutant cell may have reduced or 
5 increased phagocytic activity compared to wild-type. 

A transgenic cell may be one stably transfected with a 
gene which when expressed influences the rate of 
phagocytic activity for apoptotic cells. For example, 
the mammalian cells may be transfected With hiced-6 or 
h2ced-6 as -described above, preferably using any of 
the vectors mentioned herein in the description or 
drawings v 

In another embodiment in accordance with the invention 
the, phagocytes may be transfected with a DNA encoding 
the cell surface antigen CD36. Expression of CD36 is 
required for phagocytosis of apoptotic cells by human 
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macrophages that use either a phosphatidylseririe 
receptor or the vitronectin receptor. (Fadok V.A.et 
al, J , Immunol 1998 Dec l'. 161 (11) : 6250-7 . ) 
Transfection may be carried out by any of the methods 
described herein and preferably using a vector' 
comprising ".a DNA sequence encoding CD3 6 as shown in ' 
Figure 31 or the entire vector of Figure- 31. 



10 
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20 



25 
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35 



The methods of the invention are all performed in a 
multi-well plate format and are therefore particularly 
suitable for mid-to-high throughput screening-, in a 
Pfef erred embodiment, the multi-well plates have 96 
WejlS^ but the invention is also applicable to multi- 
well plates with another number of wells, which- 
include but is not restricted to elates with 6, 12 
24, 384, 864, 1536 wells. 



i - t - \ _ ■ _ - 

^A.- of the methods of the invention require the 
^^I?tic)n of a signal which quant itates . the 



^^^§yfesi- s . apoptotic ■ cells in ' the presence of . the 
■^fi^v^ under ' test - It is an essential feature of the 



'!?Sfi?.4s. of -this-. invention that this signal (also 
^| erred to . as the . read-out) is detected, using - a non- 
visual detection means. As used, herein- the term 
visual detection means' refers to any means . of 
QSk&tLng a signal which does. not require visual 
inspection of the human eye including, inspection 
' - a .microscope . The use of a -.non- visual- 



non- 



If llSf f;9? : system represents a major advantage over 
P5|fiptisly known screening methods which' require 



visual inspection of the cells by eye in order to 

*i . ^ ' ■ ■ - - ■ 

detect uptake of apoptotic cells by phagocytes. 



To allow for the non-visual detection of the apoptotic 
cells, in the high to mid- throughput screening in the 
phagocytosis assay, the reporter gene must be capable 
of generating a signal which is detectable by an ' 
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automatic plate reader, such as the victor2 (Wallac, 
Turku, Finland) . An automatic plate reader which 
detects a fluorescent signal is most preferred. 

5 By generating a signal which can be read by an 
automatic multiwell plate reader quantitatve 
measurements can be made and' this allows for the 
assessment of the effect of many compounds at once, as 
well as comparison of the effects between the 
10 compounds, 

i 

It is further pointed out that the compounds to be 
tested may be any. of the types of compounds described 
above and that where a particular compound results in 
15 a reduced signal or no signal for the phagocytes, the 
phagocytes will be tested for viability to rule out. 
non-specific toxicity of the compound in question. 

In the non-limiting examples, which follow reference is 
20 made to the following Figures : 

FIGURE 1 shows the construction of 2416bp consensus 
sequence which was obtained from EST, RACE and colony 
hybridization (see Example 1) . The sequence was 
25 compiled by using a al 59394 as template and primers as 
indicated in multiple alignment, rcc stands for 
reverse complement. Both ced-6 and hced-6 are 
indicated above the multiple alignment; pGAl 01 was 
picked up by colony hybridization; 

30 ' ; ; ■ ;[ "■ ; v ^ . ' 

FIGURE 2 shows the- consensus DNA sequence of h Iced- 6 
(2416 bp). Start /and stop codons are in bold and 
underlined. Alternatively, spliced region is 
underlined; 

35 

FIGURE 3 shows the alternatively spliced DNA sequence 
of h2ced-6. Start and stop condon in bold and. 
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underlined; 

FIGURE 4 shows the amino acid sequence of hCED-6. 
= Number of residues: 304, Molecular weight 34.4kDa. 
5 .again . the alternatively spliced region is underlined; 

FIGURE . 5 shows the. amino acid sequence of h2CED-6 / the 
.alternatively spliced version; 

0 . FiGURE 6 shows gel analysis of the nested PCR products 
generated as described in Example 3; the lanes are 
loaded as follows.: (1) lOObp marker, (2) primary 
■living neutrophils, (3) primary apoptotic neutrophils, 
(4) primary macrophages, (5) primary macrophages 

5 interacted with apoptotic neutrophils, (6) J774, (7) 
COS- 1 , (8)THP-1; 

FIGURE 7. shows . the DNA sequence of the commercially 
-^yailable Clonetech vector pEGFP-N3 comprising the 
D v vyr^o-r'ter" gene . GFP as used, in Examples 4. and 8; \ > 



: . . FIGURE 8 shows a plasmid map of pEGFP-N3; 

FIGURE 9 shows ■ the DNA sequence of plasmid pGA3104, as 
5 used in Examples 4 and 8, which comprises hlced-6 in 
. ;the multicloning site of pEGFP-N3; 

. . . FIGURE 10 shows a plasmid map of pGA3104; 



0 "V FIGURE .11 shows a DNA sequence of commercially. 

.available plasmid pcDNA3 .1/His/LacZ used for stable 
. trarisfection of Ba/F3 cells (see Example 4).; 

.FIGURE 12 shows fluoresence intensity as a function of 
5 transfected cell concentration when p-galactosidase is 
reacted with the fluorogenic substrate fluorescein di- 
b-D-galactopyranoside (FDG) ; 
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FIGURE 13 shows the effect of (FDG) concentration on 
the read-out of the assay of Example 5; 

FIGURE 14 shows the effect of incubation time on the 

read-out of the asisay of Example 5; 

FIGURE 15 shows the effect of serum concentration in 

medium of Ba/F3 cells on the assay of Example 5; 

FIGURE 16 shows the location of the epitopes in hlCED- 
6 used for generating polyclonal antibodies; 

FIGURE 17 shows the^ DNA sequence of the plasmid 
PGA1028 (pBAD/His A/-hced-6) used in Example 7; 

FIGURE 18 shows a plasmid map of pGA 1028; 

FIGURE 19 shows the DNA sequence of commercially 
available Promega plasmid pGL2 which is suitable for 
introduction of reporter gene luciferase into Ba/K3 
cells; 

FIGURES 20 to 25 show the results of the immunoblots 
carried out in Example 7. In all these figures the 
lanes are loaded as follows: 

Lane 1: Prestained SDS - PAGE Standards Low 

Range (Bio Rad - Hercules, CA, USA), 

' Lane 2: pBAD/His A (Invitrogen, Leek, The 

Netherlands) 

Lane 3: pGA1028 (pBAD/HisA/-hlCED-6) 

FIGURE 2 0 shows gel stained with antibodies to the 
epitope EP 990044 as identified in Example 7 and 
control . antibodies Anti-Xpress Ab •( Invitrogen, Leek, 
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- The Netherlands) and Mouse lg, horseradish peroxidase- 
linked whole antibody (from sheep) (ECL Western 
blotting detection reagents and analysis system, 
\ Airiersham .Pharmacia Biotech, UK, England); 

FIGURE 21 shows gel stained . with antibodies . to' epitope 
990044 and immune serum as described -in Example 7; 

l , * * - \ • - . -* 

FIGURE 22 shows gel stained with antibodies to epitope 
.990045 as identified in Example 7 and control 
antibodies as described for figure 20;. 

: FIGURE 23 shows gel stained with antibodies to epitope 
990045, arid immune serum as described in Example 7; 

FJG^RE\24 ' shows gel stained with antibodies to epitope. 
.9900.46 and with control antibodies as described for 
/Figure 20; 

■"•-a|pi^5>2:5: : shows; gel stained with, antibodies to epitope A 
i ; -V9!"9:p:p46 .and with immune .serum as described- in Example 

FIGURE 2 6 shows the DNA sequence of the commercially . 
• Mailable Clonetech vector pEGFP-C2 comprising the 
reporter gene GFP as used in Example 8; 

j^E^GURE; 27 shows a plasmid map of pEGFP-C2; ■/ 

■* ^^::v->^vc.r. ■ ..*.. , , * • ■ 

K-^W^ 2 ^' shows the DNA. sequence of piasmid pGA3103 as 
\.used in Example 8 which comprises hlced-6 in the 
mill tic ion ing site of pEGFP-C2; 

FIGURE 29. shows a plasmid map of pGA3103; 

FIGURE 30 shows Western blot of .cell lysates from COS- 
1/ cells transfected with MOCK (negative , control for 
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transf ection) , pEGFP-N3, pGA3103 and pGA3104; control 
lysates for actual co-immunoprecipitation from Ba/F3 
cells incubated with or without the first antibody; 
positive control lysates from EGF-stiitiulated A431 cell 
5 lysates for anti-phosphotyrosine antibody. Blot A was 
probed with a mouse monoclonal lgG2 which detects 
tyrosine-phosphorylated proteins in cell lysates; blot 
B was probed with polyclonal antibody which reacts 
with green fluorescent protein; and blot C was probed. 
10 with rabbit antiserum to hlCED-6. MW of hlCED-6 is 

34435.39; Mw of GFP is 26886.32; and Mw of the fusion 
protein GFP-CED-6 or CED-6-GFP is 62385.95; 

FIGURE 31 shows the DNA sequence of plasmid pGA1058 - 
15 which comprises a DNA sequence encoding the cell 

surface receptor CD3 6 inserted in the multicloning 
site of pEGFP-N3; 

FIGURE 32 shows a plasmid map of pGA1058; 

FIGURE 33 shows the percentage arinexin and propidium 
iodide positive cells in a cell population of Ba/F3 
cells as a function of time after withdrawal of IL-3; 

25 FIGURE 34 shows the effect of temperature on FDG 

incubation, live and apoptotic Ba/F3 cells were added 
to macrophage cell-line J774. After the phagocytosis 
assay, FDG (lOpM) was incubated for lh at 4°C, 20°C and 
. for 10 and 20 min at 37°C • . ' , 

30 - ", 

EXAMPLE 1 

Extensive searches (tblastn) with the ced-6 sequence 
(Figure 2 Consensus DNA Sequence of heed- 6) against 
35 the public domain databases (EST, Genbank, EMBL, 

Swissprot and PIR) revealed statistically significant 
homologies to some ESTS at the carboxyterminal- region 
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of the protein (AA443368, AA431995, R33389, R53881) . 
' One Est (T48513) showed homology to the 
carboxyterminal of the PTB domain and the beginning of 
the charged region. For 5*' RACE analysis a Marathon- 
ready cDNA colorectal adenocarcinoma, library was used 
from Clontech. The position of the primers used for 
RACE and sequencing is indicated in Figure 1. By 
subsequent cloning and sequence analysis additional 
sequence information was obtained. Using this 
additional sequence information and subsequent rounds 
of database searching (blastn) revealed additional. 
EST /; which enabled us to construct a consensus of 
approx. 2400 bp. This sequence was further extended 
and verified by colony hybridization and sequencing 
additional RACE products. 

EXAMPLE 2 
RNA Blots: 

A^ human, mu 1 1 ip 1 e tissue Northern (MTN-1 , Clontech) 
containing in each lane 2 mg of poly A + RNA from 
eight different human tissues (heart, brain, placenta, 
lung, liver, skeletal muscle, kidney, and pancreas) 
and a .MTN-II human multiple tissue Northern, 
containing in each lane 2 mg of poly A +■ RNA from 
spleen, thymus, prostate, testis, ovary, small 
intestine, colon and peripheral leukocyte, were 
hybridized according to the manufacturer's 
••.instructions and washed out in 0.1 x SSC, 0.2% SDS at 
55°C. Also from Clontech, a poly A +' RNA blot from 
human cancer cell lines (melanoma G361, lung carcinoma 
A549, colorectal adenocarcinoma SW480, Burkitt's 
lymphoma Raji, lymphoblastic Leukemia Molt-4, chronic 
myelogenous leukemia K562, Hela S3 and promyelocytic 
leukemia HL60) was tested. 

Expression pattern of hCED-6 in normal human tissues. 
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and cancer cell lines by Northern blotting is shown in 
Table I below; 

A) Human Multiple Tissue Northern (MTN) Blot B) Human 
Multiple Tissue Northern (MTN) Blot II C) Human Cancer 
Cell Line Multiple Tissue Northern (MTN™) Blot. 

TABLE I 



A) 





heart 


brain 


placenta 


lung 


liver 


skeletal 
muscle 


kidney 


pancreas 


Expression 
level 


+ 




+-H» | + 




++ 


+ 


+ 


length (kb) 


3,6 


1 3.6 I 3.6 




3.9 


3.6 


3.6 


B) 






spleen 


thymus 


prostate 


testis 


ovary 


small 
intestine 


colon 

(mucosal lining) 


peripheral 
blood leukocyte 


Expression 
level 


+ 


* 

• 

! •* . 

t 


++ 


+ 


+ 


+ 




length (hD) 


3.6 


! 3,6 


3.9 


3.6 


3,6 


3.6 





Q 





CO 




chronic myelogenous 
leukemia K-562 


<0 


Burkitt's lymphoma Raji 










promyelocylic leukem 
HL-60 


HeLa cell S3 


lymphoblastic, leukemi 
MOLT-4 


colorectal adeno- 
carcinoma SW480 


lung carcinoma A549 


melanoma G361 


Expression 
level 




♦+ 


j 




+++ 


+++ 


+ 


length (k&) 




3.6 1 


3.6 j 






3.6 


3.5 


3.6 
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Detectio n- of the CED-6 (hlCED-6) and its snlirp 
variant (h2CED-6) in phagocytic cell lines. . 



t 



Cell. line THP-1 . (ATCC no: TIB-202), a human monocyte 
celline that can be differentiated into a macrophage 
cell with PMA (Sigma-Aldrich, St-Louis, MO, USA) , was 
cultivated under standard conditions in RPMI 160 
medium containing 2mM L-glutamine, 1 . 5g/L sodium 
Mcarbonate, 4.5 g/L glucose, lOmM HEPES, 1 mM Sodium 
pyruvate, 0.05mM 0-mercaptoethanol. (all purchased 
from gibcoBRL, Life Technologies, Merelbeke, Belgium) 

R^A has been isolated from this cell line using the 
RNeasy mini kit from qiagen (Westburg, Leusden, the 
Netherlands) , according the instructions of the 
manufacture, or with minor modifications thereof. 



y . from' this RNA, first strand cDNA was - . ', ! -' : 

generated using the Ready-To-Go T-primed First-strand 
.kit. from Pharmacia Biotech (Piscataway, NJ, USA), 
according the instructions of the manufacture, or. with 
.minor modifications thereof. 

The generated cDNA was used in a PGR protocol" to 
^nerate DNA fragments using primers: 

% •": 5f -CGCAAGGATCCCCATGAACCGTGCTTTTAGCAGGAAG- 3 ' 

• 1^4:45^1 0934-1 3R : . 5 V -GATCTACTAGGTACTGGAG- 3 ' v ( / 

• : - . " '-Vv ; . . • - • • •' - ■ ' 

p r CR ; was performed with the TaKaRa ex Taq kit' (Takara 
Shuzo CO., -LTD, Shiga, Japan) according the 
: \^ t: £ uction s of the manufacture, or with minor 
modifications, Plasmid pGA1025, harboring the hlced-6 
\£erie was used as positive control. 
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In summary: 

PCR on the first-strand cDNA isolated from the Ready- 
To-Go T-primed First-strand kit contained the entire 
cDNA as made, 0.4 fil oGA131 (100pmol///l) , 0.4 jil 445- 
10934-13R (100pmol//zl),0.5 ^1 exTaq 5U/m1, 65.7 ^1 
water. 

PCR on the positive control contained, 10 /zl buffer 
exTaq lOx, 10 ixl dNTP mix exTaq lOx, 0.4 fil oGA131 
(100pmol/^l) , 0.4 IjlI 445-10934-13R (lOOpmol/^l ) , 2 ,ul 
pGA1025, 76.2 //l water. 
PCR-program: 

95°C 1' 



45°C 30" 
72°C 30" 



40x 



72 °C 7' 
4°C oo 

2 ytzl of each PCR reaction and 1 /zl from the positive 
control PCR reaction was used to perform a nested PCR, 
) . with following primers : ':' S . * • 

oGAl 3 1 : see above 

6GA1 4 1 : 5' -GCGGATGGTACCGTCGACTGCTGATACTTGAGTTATT 
CTCAG-3' 

PCR was performed with the TaKaRa ex Taq kit (Takara 
> Shuzo CO., LTD, Shiga, Japan) according the 

instructions of the manufacture, or with- minor 
modifications. 

The mastermix: 5 j^l buffer exTaq 10x, 5 //l dNTP mix 
exTaq 10x , . 0 . 2 /il oGAl 3 1(1 0 Opmo 1 / \il ) :, 0 .2 6GA141 . 
(100pmol///l) , 0.5 //I exTaq 5U/^1, 37.1 ~/zl water. . : 
Program: 94°C. 4' . 

94°C 1' f 



4,5 °C 30" 

72°C 30" 

72°C 7' 

4°C oo 



35x 
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10 izl nested-PCR product was analyzed on gel using 
standard protocols (Molecular Cloning, a laboratory 
manual, Sambrook et al, 1989, CSHL press). 

5 The, above procedure was repeated for primary living, 
neutrophils, primary apoptotic neutrophils,- primary 
Macrophages, primary macrophages interacted with 
apoptotic neutrophils, mouse monocyte cell-line J774 
and. COS - 1 cells. The results are shown in Figure 6. 

0 .Remarkably, only in cell-line THP-1 could hlced-6 and 
its splice variant h2ced-6 be detected (see lane 8 of 
Figure 6) . 

EXAMPLE 4 . 

Stable cell lines of human CED-6 

J77 4 murine monocyte tumour cell line (Morland and 
Kaplan, 1978, Exp. Cell Res. 115:53-61; Morland and 
||P^n, ; 1978, Exp. Cell Res. 115:63-72 Accession No 

~ : • 'v^^S'ffi^ 6 - 7 * 1 - so described as J774A. 1) cultivated in : ; ; 

'with: glutamaxl, 10% myocldhe serum (all from 
.CIBCOBItL, Life Technologies, Merelbeke, Belgium), were 

\ ^^^ ected by el ectroporation, with the plasmids. 
p|GFP r n3 (Clonetech, Palo Alto, CA, USA) (Figures 7 and 
'8) /mock transfection, 

.PGA3104, hlCED-6/.GFP fusion (Figures 9. and 10) 
. . Salmon Sperm DNA; negative control. 



;-El^Ctrpppration was performed- with Easy ject- Plus - 

?^$ c B-°P orator s y stem from Equibio Ltd ( Immunosouree, 

Halle-Zoersel, Belgium) , using following protocol- 
; .;3/X- 10 s cells were placed in 800^1 cell culture 
medium, and 30 jiq DNA was added 

•The' settings of the Easy ject Plus electroporator " were : 
double pulse: 

Voltage I = 750V, Capacitor I = 25 /zF, Resistance I = 
99R, Interpulse delay= 0 
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Voltage II 150 V, Capacitor II = 1500 //F, Resistance 
II = 99R, Optipulse. option 

3200 ixl of electroporated. cells per construct were 
seeded into a 175- cm 2 culture flask, and selected with 
5 G418 antibiotic (400 /ig/ml) (Duchefa, Haarlem, The 
Netherlands) after 72h. Subclones of clones were 
obtained and checked for GFP expression. 

EXAMPLE 5 

0 

Phagocytosis assay 
Prepari ng the 'phagocytes' 

Monocyte cell line J774 stably transfected with 
pEGFPn3 or pGA3104, or pGA1058 were plated at a 
density of 1 x 10 5 in a black 96-well plate for 36h at 
37 °C and 5% C0 2 in advance of performing the 
phagocytosis assay* 

Preparing the appptotic cells 

Growth f actor dependent . cell ' line Ba/F3 'stably " - 
transfected with fJ-galactbsidasei . as reporter gene is 
used as source of apoptotic cells. A suitable plasm-id 
for transf ecting Ba/F3 is pcDNA3 . 1/His/LacZ as shown 
in Figure 11. Ba/F3 cells which are IL-3 dependent 1 
mouse clones (Palacios and Steinmetz, 1985, Cell 
41:727-734; Palacios et al, 1984, Nature 309:126-131)/ 
were grown in DMEM with glutamaxl/ 10% FCS, 1% 
antibiotics (all from GIBCOBRL, ibid. ) , and 10% 
supernatant f rom WEHI-3 culture . : ' ' 
WEHI-3 (ATCC no.: TIB-68) produces IL-3 when grown in 
culture medium: RPMI 1640 with glutamaxl/ 10% FCS, • • 
3.6/^1 p-mercaptoethanol. per 1 litre. 
Ba/F3 cells were split H two days before the 
interaction assay (exponential growth phase) and Ba/F3 
cells (5 x lOVml) were cultured without growth factor 
IL-3 for 20h in advance of performing the assay. 
Apoptotic Ba/F3 cells were monitored by the " 
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annexin/propidium iodide labeling Kit from Boeringher- 
■ Mannheim (Brussels, Belgium). Ba/F3 cells are early 
apoptotic if 20% annexin. positive and less than 5% 
propidium iodide negative. Ba/F3 cells cultured with 
growth; factor IL-3 were used as a negative control. 
Results of the. annexin/propidium iodide test are shown 
in Figure 33. 

fiddinq the apoptoti c cells to the nti^ nryfpc 
1()Q Mi Ba/F3 cells (1 x 10 7 cells/ml) were added to 
Weils containing stably transfected J774. and to the 
-negative control wells. and incubated. at 37°C for 
periods ranging from 20 minutes to. 5 hours after which 
the., .apoptoptic cells' were removed from all wells. 
Phagocytes were washed three times in PBS buffer 

. ( S?l5^ BRL ' . ibid - ) /• -being' careful not to dislodge any 
of. the cells. 



Read-out: 

: M^lf tQ ' iS by J774 cells of the; apoptotic bodies 

-■^Qeasured by detecting- the P-galactosidase as " 

'expressed in the Ba/F3 cells. Detection was performed 
' a ' fluor °genic substrate, Fluorescein di-b-D- 

galactopyranoside (FDG)- (Molecular probes, Eugene, OR, 
USA) . 10 (jM .FDG was added to the wells and incubated' 
fo . r ^ lh . at room temperature in the dark. FDG is 
^g^?^ 3 ^ 1 ^ hvdro Aysed to FMG and fluorescein by the 
;.|^|ity of the .p-galactosidase, and the- green 

,--r^p|>sing 480mm excitation, 520mm emission/and the 
appropriate sensitivity settings. 
For calibration purposes fluorescent read-out was 
' determined for different Ba/F3 concentrations. The 
fluorescent read-out as a function of cell 
concentration is shown in Figure 12. 
Further, experiments were carried out varying the 
concentration of FDG, the incubation time of the 
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assay, the temperature of the assay and the 
concentration of serum in the Ba/F3 medium. The 
results are shown in Figures 13, 14, 15 and 34 
respectively. 

Compound Screening 

The above described phagocytosis assay is used to 
screen compounds for their ability to influence the 
level of phagocytosis of apoptotic cells by 
professional phagocytes. The test compound or 
compounds can be added to the test wells approximately 
30 minutes before addition of the Ba/F3 cells to the 
wells. Because of the multiwell format of the assay 
and automatic readout of fluorescence using standard 
equipment the assay is ideally suited to high through- 
put compound screening . 

It will be understood that where any compound the 
presence of which results in no fluorescence or : - . 
reduced fluorescence compared to phagocytic cell's not 
exposed to the compound, the cells in that well will 
be tested for viability using commercially available 
reagents such as the Live/Dead Viability/Cytotoxicity 
Kit from Molecular Probes (Eugene, USA). This kit 
provides a two-colour fluorescence cell viability 
assay that is based on the simultaneous determination 
of live and dead cells with two probes, calcein AM. and 
ethldium homodimer, that measure two, recognised : - 
parameters of cell viability, intracellular esterase 
activity and plasma membrane integrity, respectively. 
This kit is suitable for use ' with f luoresence 
multiwell plate scanners. The presence of viable cells 
will confirm that the lack of fluorescence is due to 
the effect of the compound on phagocytic activity and 
not just non-specific toxicity. 
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■ Furthermore any compound identified as. an inhibitor or 
an enhancer of phagocytosis of apoptotic ■ cells by the 

■ assay described above will be further tested to 
.confirm the effect is medicated through CED-6 . in the 
. case, 6f a compound identified as an enhancer of ■ 

phagocytosis of apoptotic cells this can be achieved 
by carrying out a phagocytosis assay exactly as 
described, above with J7.74 cells which are not 
transf ected with hlced-6 or h2ced-6. If the compound 
is , able to induce a phenotype in the J774 cells which' 
is similar to the. phenotype of those cells when 
transfected with heed- 6 then if is an indication that 
the, compound in question exerts its effect via CED-6 
or via' the CED-6 signal transduction pathway. • 

Simila . rly ' if a compound identified in the -above- 
described assay is an inhibitor of . phagocytosis of 
apoptotic cells it can be confirmed whether its effect 
4|^jf,CED-6 or the CED-6 signal transduction, pathway 
||&l^|ining the phenotype of the transf ected j774' : ' 
cells .exposed to the compound. Reversion to a 'wild-'' 
typ^ .phenotype is an indication of action via CED-6 
signal transduction, pathway/. 



-EXAMPLE; 6 ,■ 

Polyclonal antib odies directed to human CED-6 ' 

^.yclpnal antibodies where raised in rabbits against 
.^e^nowing ced- 6 epitopes: . 

:||p^ 4 ;^ : «2N .- - NRA FSR KKD KTC C0NH2 .' ' - . \[ ■.. ■; 
:; E^|pQ45 H2N - CFL. GST EVE QPK GTE C0NH2 
EP99004 6 H2N - CTR NGT QPP PVP. SRS T C0NH2 
Location of these epitopes in the ced- 6 protein is 
shown in Figure 16. 

The polyclonals were raised by Eurogentec Bel, 

Herstal, Belgium, using following protocol: 

Day 0 : taking of pre -immune serum followed by the 
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first immunisation 

Day, 14 : second immunisation 

Day 28 : third immunisation 

Day 38 : blood sampling (shipping 2ml) 

5 Day 56 : fourth immunisation 

Day 66 : blood sampling (shipping 2ml + 20ml) 

Day 80 : complete bleeding. 

EXAMPLE 7 

0 : \ . 

Testing Antibodies bv Western Blot 

Transformation of plasmid pGA1028 (pBAD/His A-hCED-6) 
(see Figures 17 and, 18) was done in TOP 10 competent 
cells. Furthermore pBAD/His was transformed in the 
same E. coli cells as a negative control . pBAS/HisA 
and E. coli TOP 10 were purchased from Invitrogen 
(Leek, The Netherlands). 

The pBAD-vectors expression system was used/ as it is 
known to be an efficient expression system. In the 
presence of arabinose, expression from pBAD is turned 
on while the absence of arabinose produces very low 
levels of transcription from pBAD. A pilot expression 
was carried out according the instructions of the 
manufacturer, in which the amount of arabinose was 
varied to determine the approximate amount of 
arabinose needed for maximum expression of your 
protein. The protocol according to the manufacturer 
(invitrogen, Leek, The Netherlands) was used. " ; ; 
Expression was scaled up using the same protocol. 

Purification of protein was performed from the E. coli 
cells transformed with pGA1028: 5 ml lysis buffer (10 
ml TE Ix pH 8, 0 . 5 mg/ml lysozyme, O.lmg/ml DNAse, 100 
Ml 1M CaCl 2 400 fjLi protease inhibitor 25x) was added 
to the pellet of a 50ml expression induced culture, 
and the pellet was resuspended in this lysis buffer. 
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The suspension was placed for 30 mins on ice and 
sonicated 3 times for 5" (high density), after which 
the suspension was treated with 3 cycles of f reeze- 
. defreeze (liquid nitrogen - 42°C) , and placed for 30 
min at 37°C- The. suspension was centrifuged for 5' at 
iciaximal speed. The pellet which contains the 
insoluble fraction and also the hCED-6 fusion protein 
was 'resuspended in 1 ml 2M urea and shaken for 5' at 
1200 rpm. This suspension was centrifuged for 5' at 
maximal speed, and the supernatant was used for gel 
electrophoresis and Western blotting. 25 /zl' supertant 
and- 25 jil premixed Laemmli Sample Buffer (Bio Rad- 
Hercules, CA, USA) was mixed. 

Proteins from the negative control were not purified. 
E. coli transformed with pBAD/His were prepared by 
pelleting 1 ml of induced E. coli culture, and 
resuspending the pellet in 1 ml of premixed Laemmli. 
Sample Buffer (Bio Rad- Hercules, CA, USA) . As such 
P ,. this can be used for PAGE Gel 

electrophoresis. 

Preparation of samples to load on a gel: 
Both the samples were boiled for 5' and placed on ice 
P? ior to loading. 25 samples were loaded on a . - 
Ready Gel, 50 #1 well TrisHCl, 4-15% (Bio Rad- 
-^£S^ 1( ~ S ' CA ' USA) and electrophoresis was performed 
. Recording to the manufacturer's instructions. The 
SP^? ins of : the 9 el were: transferred on nitrocellulose- 
: membrane (Trans-blot Transfer medium, Bio Rad- 
Hercuies, CA, USA) with a MiniTransBlot 
electrophoresis cell (Bio Rad- Hercules, CA, USA) 
according to the instructions of the manufacturer (Bio 
Rad- Hercules, CA, USA). 

Western Blot was performed according to the providers 
of the antibodies, and the detection kit. 
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A first antibody, in the western blots, immune serum 
or pre-immune serum of the rabbits was used in a 
dilution of 1/2000 in PBST (1.44 g/L KH2P04, 90 g/L 
NaCl, 7.75 g/L Na2HP04 . 7H20, 0.1% Tween) 
A second antibody: anti-rabbit, horseradish 
peroxidase-linked whole antibody (from donkey) diluted 
1/4000 in PBST (0.1%) (ECL Western blotting detection 
reagents and analysis system, amersham pharmacia 
biotech, UK, England) . 

All incubations were performed according to the 
manufacturer, with the following modifications. The 
first antibody was incubated overnight in PBST, 
instead of TBST. Detection of the antibodies was done 
according to the manufacturer's instructions (ECL 
Western blotting detection reagents and analysis 
system, Amersham Pharmacia Biotech, UK, England). 
Stripping and reprobing membranes after ECL detection 
kit, was done according to the manufacturer's 
instructions (ECL Western blotting detection reagents 
and analysis system, Amersham Pharmacia Biotech, UK, 
England) . 

Western Blot with control antibodies was done 
according the manufacturer's instructions of the anti- 
HisA antibodies (invitrogen, Leek, The Netherlands) , 
The antibody, designated Anti-Xpress antibody diluted 
1/5000 in PBST (0.1%) (invitrogen, Leek, The 
Netherlands) was used as first antibody. 
Second antibody: Mouse Ig, horseradish peroxidase- 
linked whole antibody (from sheep) diluted 1/4 000 in 
PBST (0.1%) (ECL Western blotting detection reagents 
and analysis system, Amersham Pharmacia Biotech/ UK, 
England) . 

The staining of the antibodies in both experiments 
were performed according to the instructions of the 
manufacturer (ECL Western blotting detection reagents 
and analysis sys.tem, Amersham- Pharmacia Biotech, UK, 
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England) . 

The results are shown in Figures 20 to 25. 

5 EXAMPLE 8 

Interaction of H-CED-6 With Phospharvlated Tyrosine 
Proteins 

Co-immunoprecipitation 

The antibodies raised against the three epitopes of 
10 hlCED-6 were used in western blotting to detect CED-6 
interactions with phosphorylated tyrosine proteins and 
to identify CED-6 interacting proteins. 

Transfection 

Transfection of COS^-1 cells was performed with 
15 . plasmids pGA3103 (see Figures 26 to 29) and pGA3104 
(see Figures 7 to 10) in a 175 cm 2 flask .(1 x 10 7 
..cfslls) . As a negative control: MOCK and pEGFP-N3 were 

used. Full length human ced-6 (in frame with GFP, 

» - • 

both N and C terminal fusions, internal control) were 

. ■' ■ ■ . * • . ., • • ■ . ' . • ■ 

20 -^-^ investigated. COS-1 cells were transf ected with 

: lipo feet amine Plus reagent (GIBCO-BRL) . The protocol 
from Life Technologies was followed and the volumes 
that were, used are shown in Tabie 2. 

MOCK transfected cells are a negative control for 
25 : transfection. In place of adding DNA, the solvent of 
the DNA only is added to the cells. Solvent of DNA is 
TE buffer, pH=8: 1M Tris (ICN) Ph=8 and 0.5 M EDTA 
;:;(Merck-Belgolabo) pH=8 in H20. 

30 . . TABLE 2 . 



, Lipofectamine transfection (Life Technologies) of COS-1 cells in a 175 cm 2 flask 
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Culture 
flask 


Construct 


Cone. 
DNA 


DNA 


DNA 


Op time m 


Plus 
reagent 


Opamem 


Lipo 
fecta 
mine 


Opamem 


Optimem 












Mi 


ul 


ul 




ul 


ml 


175 cm' 


MOCK 


• 


* 


12 (TE) 


1125 


60 


1125 


90 


2250 


15 


175 cm 2 


PEGFP-N3 


1 


12 


12 


1125 


60 


1125 


90 


2250 


15 


175 cm ; 


PGA3103 


1 


12 


12 


1125 


60 


1125 


90 


2250 


15 


175 cm 1 


PGA3104 


1 


12 


12 


U25 


60 


U25 


90 


2250 


15 
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As a positive control for phosphorylation, the (J-chain 
of the IL-3 receptor of Ba/F3 cells which is 
phosphorylated was used, 
5 COS-1 cell lysates were prepared using DIGITONIN (as 
gently as possible, not to disturb the interaction) 
and phosphatase inhibitors added (protease inhibitors, 
preferably cocktailpils or pefablock) to the lysis 
buffer . 

10 

• Low stringency DIGITONIN based buffer: 
Buffer with DIGITONIN in 10 ml bidi 

• 1 % digitonin (Serva. 19551, MW 1229.3) SERVA 
2% stock= 250 mg in 12.5 ml 

15 • 10 mM triethanolamine pH 7.8 (Sigma-Aldrich; 

Bornem, BelgiumlOX stock 100 mM= 185,7 mg in 
10ml pH 7.8 (5ml in 50ml) 

• 0.15 M NaCl (MW 58.44) 87.66 mg per 10ml 

• 2 mM Na 3 V0 4 (Sigma-Alrich, Bornem , Belgium) 
20 3.687 mg per 10 ml 

• 2 mM EDTA (Titriplex III; MW 372.24) 
(Darmstadt, Germany). 

7.444 mg per 10 ml 

• 200 U/ml aprotinin - Trazilol (Sigma-Aldrich, 
25 Bornem, Belgium) 

1 mg =11 TIU = 9900 KU 

200X stock: 10 mg in 2 ml PBS = 49500 KU (50^1 
in. 10ml) 

• 1 mM Pefabloc (Merck , Darmstadt, Germany) 2. 4 
30 mg per 10 ml 

Lvsis of cells 

• Transfected cells were washed 2 x in PBS Dulbecco's 
(GIBCOBRL) in falcon 

35 • Cells were scraped and pellet resuspended in 300^1 

lysis buffer. 
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• All manipulations were carried out at 4°C. 

• The preparation was centrifuged at 4000 rpm and the 
• supernatant transferred to a new tube. 

Preclearance 



• Protein G sepharose CL-4B beads (Amersham 
.Pharmacia, Roosendaal, the Netherlands) were 
supplied freeze dried in the presence of additives. 
.These additives were washed away at neutral pH and 
ethanol replaced with lysis buffer. 

• 50% v/v Protein G sepharose suspension: 

.1 ml 50/50 v/v Protein G sepharose was pipetted and 
centrifuged at high speed for 5 sec. It was then 
aspirated and resuspended in equal volume of lysis 
buffer. Washing was repeated three times. 

•: To "300 ill of lysate was added 50 //l of protein G 
sepharose CL-4B beads (Amersham Pharmacia, ibid.) 

."• and this was reacted for 1 hour. 

; wasi then centrifuged 10 sec at 14 000 rpm and 
4°C and the supernatant transferred to a new tube. 

Co-immunoprecipitation 

•• COS-l lysates: 5 fil anti-green fluorescent protein 
(GFP) polyclonal antibody rabbit (Immuno source, 
. Halle-Zoersel, Belgium) was added. 
^/..Lypphilized form was dissolved in .100 distilled 
: ; . ;water then frozen at -20°G 

Ba/F3 lysate number 5: 5 jul of rat antibody to p-' 

chain of IL-3 receptor (Van der Heyden J., Devos 
■ ; R., Plaetinck G., Fache I., Fiers W. Y Tavernier 
J. 1991. Characterization of the murine IL-5 
receptor complex with the use of a panel of 
monoclonal antibodies. Relationship to the 
murine IL-3 receptor. J Immunol. 147:3413-3418) 
was added. 
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• Ba/F3 lysate number 6: no antibody was added. 

• Samples were incubated between 4 h and 24 h 
(overnight) at 4 °C, rotating 

• 50 /zl protein A beads were added and incubated for 
5 1 hour at 4°C. 

• Samples were centrifuged for 3 min at 3000 rpm 
(4°C) 

• Beads were resuspended in 800 fxl lysis buffer, 
inverted several times or rotated for a few minutes 

10 and centrifuged at 3000 rpm for 3 minutes (4°C) . 

This was repeated three times and on the last 
occasion the wash buffer was removed with a 
capillary tip. 

• Beads were suspended in 20 jul SDS loading buffer 
15 (with -mercapto) 

• Lysate number 7= EGF-stimulated A431 Cell lysate 

(positive control for an ti-phospho tyrosine) 
(Upstate Biotechnology, cat. No. 12-302) 

• 2.5//1 of (S-mercaptoethanol was added to 100^1 of 
20 - lysate and samples boiled. 

• Samples were centrifuged for 3 min at 3000 rpm (4 
°C) and SN was loaded on SDS PAGE using prestained 
SDS-Page standards low range from BioRad (cat.no 
161-0305) 



25 



Western-blotting 



• Cell lysates were transferred to nitrocellulose 
with: • 

30 • Transfer buffer= 48 mM Tris, 39 mM glycine, 20% 

* . ■ 

methanol, pH 9.2 (5.82 g Tris, 2.93 g glycine in 
H 2 0, 200 ml methanol, to 1 L H 2 0 

• Blocking buffer: lx PBS, 0.1% Tween, 5% milk 

powder, incubate blot overnight 
35 • Gel was probed with anti-phosphotyrosine (cat. 05- 

321, Upstate Biotechnology) : 1 jug/ml for 3h and 
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washed twice with PBS, 0.1% Tween for 5 min 

• Proteins were visualized using 1:4000 goat anti- 

mouse horseradish peroxidase (cat.no RPN 2108, 
Amersham pharmacia biotech) as second Ab for lh 
5 at RT. 

• Blots were washed twice with PBS, 0.1% Tween for 5 
min, twice with blocking buffer for 5 min and twice 

; with H 2 0 (5min) 

0 ECL Western blotting analysis <?Y <?*-**m 

.'ECL Western blotting detection reagents from Amersham 
Pharmacia Biotech (cat.no RPN 2108) were used. 

Stripping and reprobing blot after ECL detection kit 

5 

The membranes were stripped of bound antibodies and 
reprobed. Membranes were stored wet in saran wrap at 
4°C after each immunodetection. 

- :-°*^'«;v7v'' ; :- : ■■ • . --. - _ . - 

, _ y* ■ i.. -™ . ^. ■ - - . ' ■ 

•"The membrane was submerged in stripping buffer (100 
' mM p-mercaptoethanol, 2% SDS, 62.5 mM Tris-HCl pH 

6.7) and incubated at 50 °C for 30 min with 
.," occasional agitation. 

The membrane was washed for 2 x 10 min in. PBS, 0.1% 
Tween at room temperature using large volumes of 
wash buffer. 

• , V The membrane was blocked by immersing in blocking 
;; buffer for lh at RT . 

Immunodetection was performed with anti-green 
fluorescent GFP and repeated with anti-human CED-6 

Results 

Western blots of all cell lysates probed either with 
anti-phosphotyrosine, anti-green fluorescent protein 
(GFP) or with rabbit sera against CED-6 are shown in 
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Figure 30, blots (A), (B) and (C) . One band between 
49 and 74K stained with anti-phospotyrosine is 
present in the COS-1 cell lysates transfected with 
fusion proteins of GFP and CED-6 and is not present 
5 in the control COS-1 cell lysates. By probing the 
western blot with anti-GFP and anti-CED-6 the same 
band between 49 and 74K was stained. 

Conclusion 

10 

Fusion proteins CED-6-GFP and GFP-CED-6 are both 
tyrosine phosphorylated. Their molecular weight is 
62385. 95K which represents the band between 4 9 and 
74K that is stained positive for anti- 
15 phosphotyrosine, anti-green fluorescent protein (GFP) 
and anti-CED-6. 

EXAMPLE 9 

20. Stable cell lines transfected with human, cell surface; 
receptor CD36 . 

J774 murine monocyte tumour- cell line was transfected 
by electroporation with plasmid pGA1058 shown in 
25 Figure 31 and 32. The methods used were as described 
ih example 4 for human ced-6. 

The transfected cell-line was used as a positive 
control in carrying out phagocytosis assays using the 
..." protocol of Example 5 . 

30 

EXAMPLE 10 

Generation of apoptotic particles starting from PC12. 

35 The PC-12 cell-line. (ATCC number: CRL-1721) (Mesner 
P.W., Winters T.R., Green S.H.,(1992) J. Cell 
6101.119:1669-1680, tends to grow in small clusters. 
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By addition of nerve growth factor-beta (50ng/ml 
final cone, Sigma), PC-12 cells differentiate into 
neuronal cells. By withdrawal -of nerve growth factor 

. after 5 days of treatment, programmed cell death in 
5 neuronal rat PC12 cells' is induced. 

;The. cells are cultured in RPMI 1640 (Life 

: Technologies) with 2mM L-glutamine adjusted to 
contain 1 . 5g/L sodium bicarbonate, ,4, 5g/L glucose, 
lOmM HEPES and ImM sodium pyruvate, 10% horse serum, 
10 5% fetal bovine - serum, 

V;. These cells can be tested for apoptotic character 
\using the annexin/PI kit described above. 
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The nucleotide and 
Figures herein are 
5 Nos- 

SEQ ID NO: .1 

SEQ ID NO: 2 

SEQ ID NO: 3 
10 SEQ ID NO: 4 

SEQ ID NO: 5 

SEQ ID NO: 6 

SEQ ID NO: 7 

SEQ ID NO: 8 
15 SEQ ID NO: 9 

SEQ ID NO: 10 

SEQ ID NO: 11 

SEQ ID NO: 12 

SEQ ID NO: 13 
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SEQUENCE LISTING 

amino acid sequences shown in the 
designated the following SEQ ID, 

Figure 1 
Figure 2 
Figure 3 
' Figure 4 
Figure 5 
Figure 7 
Figure 9 
Figure 11 
Figure 17 
Figure 19 
Figure 26 
Figure 28 
Figure 31 
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CLAIMS : 

1. An expression vector comprising a sequence 
of deoxynucleotides encoding a human CED-6 protein 
comprising the amino acid sequence of Figure 4 or 
Figure 5 or an amino acid sequence which differs from 
the amino acid sequence of Figure 4 or Figure 5 only 
in. amino acid changes which are conservative of 
function. 

2. . An expression vector as claimed in claim 1 
comprising the sequence of deoxynucleotides shown 
from the transcription start codon to the 
transcription stop codon in Figure 2 or Figure 3. 

3. An expression vector as claimed in claim 1 
or claim 2 which comprises a sequence of 
deoxynucleotides encoding a reporter gene positioned 
in said vector such that expression of said human 
CED-6 protein or functionally conserved variant 

"thereof results in expression of a reporter protein 
from said reporter gene. 

A. An expression vector as claimed in claim 3 
wherein said reporter gene is positioned 5' to the 
sequence of deoxynucleotides encoding said human CED- 
6 protein or functionally conserved variant thereof. 

.." 5. An expression vector as claimed in claim 3 
wherein said reporter gene is positioned 3' to the 
sequence of deoxynucleotides encoding said human CED- 
6 protein or a functionally conserved variant 
thereof. 

6. An expression vector as claimed in any of 
. claims 3 to 5 wherein said reporter gene encodes 
green flourescent protein (GFP) . 
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7. An expression vector as claimed in claim 4 
which is pEGFP-C2 with a sequence of deoxynucleotides 
encoding a human CED-6 protein which comprises the 
sequence of amino acids as shown in Figure 4 or 

5 Figure 5 or a functionally conserved variant thereof 
inserted in the multicloning site . 

8. An expression vector as claimed in claim 5. 
which is pEGFP-N3 with a sequence of deoxynucleotides 

10 encoding a human CED-6 homologue which comprises the 

sequence of amino acids as shown in Figure 4 or 

Figure 5 or a functionally conserved variant thereof 

inserted in the multicloning site. 

15 9. An expression vector as claimed in claim 4 

or 7 wherein said vector comprises the nucleotide 
sequence of Figure 28 (pGA3103) . 

10. An expression. vector as claimed in claim 5 
20 or 8 wherein said vector comprises the nucleotide 

sequence. of Figure 9(pGA3104). 

11. An expression vector as claimed in claim 1 
or claim 2 wherein the human CED-6 protein or 

25 functionally conserved variant thereof expressed from 
said vector includes an epitope tag at the amino 
and/or the carboxy terminus thereof. 

12. An expression vector as claimed in claim 11 
30 wherein said epitope tag is His A. 

13. An expression vector as claimed in claim 11 
which is pBAD/HisA with a sequence of 
deoxynucleotides encoding a human CED-6. protein 

35 comprising the sequence of amino acids as shown in 
Figure 4 or Figure 5, or a functionally conserved 
variant thereof, inserted therein. 
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14* An expression vector as claimed in claim 12 
which has the sequence of deoxynucleotides shown in 
Figure 17 (pGA 1028) . 

15. A mammalian cell-line transfected with an 
expression vector as claimed in any one of claims 1 
to 13. . 

16. A mammalian cell-line as claimed in claim 
15 : wherein said cell is selected from a fibroblast 
cell-line or an epithelial cell line. 

" 17. A mammalian cell-line as claimed in claim 
16 wherein said cell-line is selected from COS1, 
BHK21, L929, CV1, SWISS 3T3, HT144-, IMR32, HEPG2, 
MDCK, MCF7, 293, Hela, A549, SW48 or G361. 

18. A mammalian cell-line as claimed in claim 
15- wherein said cell-line is a primary cell-line. 

;*;19. A mammalian cell-line as claimed in claim 
1.8 wherein said cell-line is selected from human 
ciermal FIBs, dermal keratinocytes, leucocytes, 
monocytes, lymphocytes, dendritic cells or 
macrophages . 

* t ' ■ 

20. A mammalian cell-line as' claimed in claim 

. 19. which is mouse macrophage cell-line J774 or human 
•V^ohocyte \cell.^lihe'-TH'P-l-. 

I- , r " ' . 

21. A method for determining whether a compound 
'is an inhibitor or an enhancer of a signal 
transduction pathway which promotes phagocytosis of 

"apoptotic cells which method comprises exposing 
transfected mammalian cells as claimed in any one of 
claims 15 to 20 to apoptotic particles and measuring 
the rate of phagocytic uptake of said particles by 
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said transfected cells in the presence and absence of 
said compound. 

22. A method as claimed in claim 21 wherein 

5 said transfected cells are exposed to said compound 
prior to addition of said apoptotic particles. 

23. A method as claimed in claim 21 or 22 
wherein said apoptotic particles are selected from 

10 the group consisting of apoptotic neutrophils, 

apoptotic lymphocytes and apoptotic erythrocytes 
which optionally have been opsonised. 

24. A method as claimed in any of claims 21 to 
15 23 wherein said apoptotic particles comprise adherent 

cell-line PC12. 

25. A method as claimed in any of claims 21 to 
23 wherein said apoptotic particles comprise the 

20 growth factor dependent mouse cell-line Ba/F3 . 

26. A method as claimed in claim 25 wherein 
said Ba/F3 cells are rendered apoptotic by culturing 
in the absence of growth factor IL-3. 



25 



30 



27. A method as claimed in any one of claims 23 
to 2 6 wherein the cells comprising said apoptotic 
particles are stably transfected with a reporter 
gene . 



28. A method as claimed in claim 27 wherein 
said reporter gene is selected from a gene, encoding 
p-galactosidase, a gene encoding a fluorescent 
protein or a gene encoding a protein capable of 

35 generating luminesence . 

29. A method as claimed in claim 28 wherein 
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■ said protein capable of generating luminescence is 
luciferase. 

30. A method as claimed in claim 28 wherein 
5 said fluorescent protein is green fluorescent 
protein, 

. 31. A method as claimed in claim 26 wherein 
said apoptotic particles comprises Ba/F3 cells stably 
10 .transfected with (S-galactosidase or luciferase. * 

' 32. A method as claimed in claim 31 wherein the 
level of phagocytosis is detected by adding a 
substrate which is converted by said p-galactosidase 
15 to a. fluorescent compound. 

. 33. A method as claimed in any one of claims 21 
. to 32 wherein if no phagocytosis or a reduced amount 
of; phagocytosis is observed on exposure to the test 
20 said mammalian transfected cells are 

examined for viability. 

. y 34 . A method as claimed in claim 33 wherein if 
,:.viable the phenotype of said mammalian transfected - 
25 ; . cells is compared with the phenotype of untransfected 
.mammalian cells of the same cell-line. 

35. A method as claimed in any of claims 21 to. 

32 wherein if, an increased amount of phagocytosis is 

30 observed in the presence of the test compound, the 

-method. comprises, the further steps of exposing said 

epmpound. to an untransfected mammalian cell of the 

same cell-line and observing whether the compound 

*-...•-'.- ■ . ■ ■ 

induces a phenotype which is substantially the same 

35 as the phenotype exhibited by said transfected 

/ mammalian cell. 



.it. 
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36. A compound identified, by the method of any 
of claims 21 to 35 as an inhibitor or an enhancer of 
a signal transduction pathway which promotes 
phagocytosis of apoptotic cells. 

37. A method for determining whether a compound 
is an inhibitor or an enhancer of a signal 
transduction pathway which promotes phagocytosis of 
apoptotic cell which method comprises the steps of: 



(1) micro-injecting into a mammalian cell a 
human CED-6 protein comprising the sequence 
of amino acids shown in Figure 4 or Figure 
5 or a sequence of amino acids differing 

15 from that shown in Figure 4 or Figure 5 

only in amino acid changes conservative of 
function. 

(2) exposing the mammalian cell produced in 
20 step (1) to apoptotic particles and 

measuring the rate of phagocytic uptake of 
said particles by said cell in the presence 
and absence of said compound. 

25 38. A method as claimed in claim 37 wherein 

said micro-injected mammalian cells are exposed to 
said compound prior to addition of said apoptotic 
particles . 

30 39. A method as claimed in claim 38 wherein " 

said apoptotic particles are selected from apoptotic 
neutrophils, apoptotic lymphocytes and apoptotic 
erythrocytes which optionally have been opsonized. 

35 40. A method as claimed in any of claims 37. or 

38 wherein said apoptotic particles comprise adherent 
cell-line PC12. * - 
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41. A method as claimed in any of claims 37 to 
39 wherein said apoptotic particles comprise the 
growth factor dependent mouse cell-line Ba/F3. 

42. A method as claimed in claim 41 wherein 
said Ba/F3 cells are rendered apoptotic by culturing 
in the absence of growth factor IL-3. 

. 43. A method, as claimed in any one of claims 39 
•"• to 42 wherein the cells comprising said apoptotic 
particles are stably transfected with a reporter 
gene . ' 

44.. A method as claimed in claim 27 wherein 
said reporter gene is selected from a gene encoding 
p-galactosidase, a gene encoding a fluorescent 
protein and a gene encoding a protein capable of 
.generating luminescence . 

' -.:4 5. ' A method as claimed in claim 44 wherein v 

; said protein capable of generating luminescence is 
lucif erase . ••• ' 

.46, A method as claimed in claim 44 wherein 
said fluorescent protein is green fluorescent 
.protein. 

47 . A method as claimed in claim 42 wherein. 
: Y said apop t o t i c particles: comprise Ba/F3 cells stably, 
transfected with (3-galactosidase . ' 

- 4.8. A method as claimed in claim 47 wherein the 
level of phagocytosis is detected by adding a 
substrate which is converted by said p-galactosidase 
to a fluorescent compound. 

49. A method as claimed in any of claims 37 to 
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48 wherein the mammalian cell is a fibroblast cell or 
an epithelial cell. 

50. A method as claimed in claim 49 wherein the 
5 mammalian cell is selected from C0S1, BHK21, L929, 

CV1, SWISS 3T3, HT144, IMR32, HEPG2, MDCK, MCF7, 293, 
Hela, A549, SW48 or G361. 

51. A method as claimed in any of claims 37 
10 48 wherein said mammalian cell is a primary cell. 

52 . A method as claimed in claim 51 wherein 
said mammalian cell is selected from human dermal 
FIBs, dermal keratinocytes, leucocytes, monocytes 

15 macrophages. 

53. A method as claimed in claim 52 wherein 
said mammalian cell is a mouse macrophage cell J774 
or a human monocyte cell THP-1. 

20 . -,' ■" 

54. A method as claimed in any one of claims 37 
to 53 wherein if no phagocytosis or a reduced amount 
of phagocytosis is observed on exposure to the test 
compound, said mammalian transfected cells are 

25 examined for viability. 

55. A method as claimed in claim 54 wherein, if 
viable, the phenotype of said mammalian transfected 
cells is compared with the phenotype of untransf ected 

30 mammalian cells of the same cell-line. 

56. A method as claimed in any of claims 21 to 
32 wherein if an increased amount of phagocytosis is 
observed in he presence of the test compound, the 

35 method comprises the further steps of exposing said 
compound to an untransf ected mammalian cell of the 
same cell-line and observing whether the compound 
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induces a phenotype which is substantially the same 
as the phenotype exhibited by said transfected 
mammalian cell* 



5 57. A compound identified by the method of any 

of claims 37 to 56 as an inhibitor or an enhancer of 
a. signal transduction pathway which promotes 
phagocytosis of apoptotic cells. 

10 -58. A method for determining whether a compound 

.is an inhibitor or an enhancer of a signal 
. transduction pathway which promotes phagocytosis of 
"apoptotic cells which method comprises the steps of: 

15 (1) micro-injecting or transfecting into a 

mammalian cell a vector expressing RNA 
' / antisense to all or a portion of the 

sequence of nucleotides shown in Figure 2 
or Figure 3; 

(2) exposing the mammalian cell produced in 
. step (1) to apoptotic particles and 
measuring the rate of phagocytic uptake of 
said particles by said cell in the presence 
25 or absence of said compound. 

59. A method as claimed in claim 58 wherein , 
..said antisense RNA comprises a sequence of 
.•;riucieotides ' which is capable of .hybridizing to a . 
30 sequence of nucleotides as shown in. Figure 2 or 

•Figure 3 under conditions of stringency which are 
higher than 2xSSC; 0.1% SDS; 25°C to 50°C. 



; : .'60. A method as claimed in claim 58 or claim 59 
35 comprising the features of any one of claims 38 to. 
56. 
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61. A compound "identified by the method claims 
58 to 60 as an inhibitor or an enhancer of a signal 
transduction pathway which promotes phagocytosis of 
apoptotic cells. 

62. A peptide which comprises a fragment of a 
human CED-6 homologue having an amino sequence as 
shown in Figure 4 wherein said fragment includes the 
sequence of amino acids NRAFSRKKDKTC, FLGSTEVEQPKGTE 
or TRNGTQPPPVPSRST . 

63. A peptide as claimed in claim 62 consisting 
of the sequence of amino acids NRAFSRKKDKTC, . 
FLGSTEVEQPKGTE or TRNGTQPPPVPSRST . 

64. An antibody preparation comprising 
antibodies directed to one or more of the following 
epitopes of human CED-6 homologue as shown in Figure 
4: NRAFSRKKDKTC, FLGSTEVEQPKGTE or TRNGTQPPPVPSRST 

65. An antibody preparation comprising 
antibodies directed to the human CED-6 homologue 
epitope NRAFSRKKDKTC. 

66. An antibody preparation comprising 
antibodies directed to the human CED-6 homologue 
epitope FLGSTEVEQPKGTE. 

67. An antibody preparation comprising 
antibodies directed to the human CED-6 homologue 
epitope TRNGTQPPPVPSRST. 

68 . An antibody preparation as claimed in any 
one of claims 63 to. 66 wherein said antibodies are 
polyclonal antibodies. 

69. A method for diagnosing a disease 
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associated with the over or under expression of human 
CED-6 protein in phagocytic cells in an individual 
which comprises: 

5 (a) obtaining a sample of phagocytes from said 

individual; 

(b) exposing said phagocytes to an antibody 
preparation as claimed in any of claims 64 

10 to 68; 

(c) quantitatively measuring the presence of 
any immune complexes formed between said 
antibodies and said CED-6 protein; and 

15 

(d) comparing the amount of immune complex 
formed with that formed using phagocytes 
from a control individual, 

20 : , 70 . A method for determining whether a compound 

is an inhibitor or an enhancer of a signal 
transduction pathway which promotes phagocytosis of 
appptotic cells which method comprises: 

25 (a) exposing a mammalian cell transfected with 

an expression vector as claimed 1 in any one 
of claims 1 to 14 to the compound to be 
• tested; 

30 - (b) exposing said mammalian cell to an antibody 

preparation as claimed in any of claims 64 
to 68; 

(c) quantitatively measuring the presence of 
35 any immune complex formed between said 

antibodies and protein expressed by said 
cells; and . 
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(d) comparing the level of immune complex 

detected with the amount of immune complex 
detected in a mammalian cell transfected as 
described in step (a) which has not been 
5 exposed to said compound. 

71. A method as claimed in claim 70 wherein 

said mammalian cell is selected from C0S1, 
BHK21, L929, CU1 SWISS 3T3, HT144, IMR32, 
10 HEPG2, MDCK, MCF7, 293, Hela, A549, SW48 or 

G361. 

72.. A method as claimed in claim 71 wherein 
said mammalian cell is a C0S1 cell. 

15 

73. A method as claimed in claim 70 wherein the 
mammalian cell is a human dermal FIB, dermal 
keratinocyte, leucocyte, monocyte or macrophage. 

20. 74. A method as claimed in claim 73 wherein 

said cell is mouse monocyte cell J774 or human 
monocyte cell THP-1. 

75. A fusion protein which comprises: 

25 (1) a sequence of amino acids as shown in 

Figure 4 or Figure 5 or a sequence of amino 
acids which differs from the sequence shown 
in Figure 4 or Figure 5 only in amino acid 
changes conservative of function; and 

30 (2) 'a protein which is the expression product 

of a reporter gene. 

76. A fusion protein as claimed in claim 75 
which is obtained by expression of the GFP and hlced- 

35 6 encoding sequences shown in Figures 9 or 28. 

77. A fusion protein which comprises: 
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(1) a sequence of amino acids as shown in 
Figure 4 or Figure 5 or a sequence of amino 
acids which differs from the sequence shown 
in Figure 4 or Figure 5 only in amino acid 

5 changes conservative of function, and 

(2) an epitope tag. 

.7.8. A fusion protein as claimed in claim 77 
10 Which is obtainable by expression of the HisA and 
hlced-6 encoding sequences shown in Figure 17. 

79. A method of identifying a compound which is 
an enhancer or an inhibitor of phagocytosis of 
15 apoptotic cells which comprises: 

a ) exposing a mammalian professional or semi- 
professional phagocyte to an apoptotic 
mammalian cell which has been stably 
20 ■ transf ected with a reporter gene capable' of 

generating a signal detectable without 
microscopy, in the presence and absence of 
the compound to be tested, 

25 , •• b) removing any apoptotic cells which are not 

engulfed by said phagocytes and 

. .. . c) detecting any signal of the reporter gene 

from said phagocytes; 



30 



=»V.-.. --i 



- wherein any difference in signal in the presence 
of said compound compared to the signal in the 
. absence of said compound is an indication that said 
■ compound is an inhibitor or an enhancer of 
35 phagocytosis of apoptotic cells. 

80. A method as claimed in claim 79 wherein 
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said phagocyte is mouse macrophage cell- line J774 or 
human monocyte cell-line THP-1. 

81. A method as claimed in claim 80 wherein the 
5 monocyte cell-line is cultured under conditions to 

dif f erentiate , it into macrophages prior to exposure 
to said apoptotic particles. 

82. A method as claimed in any of claims 79 to 
10 81 wherein said phagocyte is a transgenic cell* 

83. A method as claimed in claim 82 wherein 
said phagocyte has been transfected with an 
expression vector as claimed in any of claims 1 to 

15 14. 

84. A method as claimed in claim 82 wherein 
said phagocyte has been transfected with an 
expression vector encoding the cell surface receptor 

20 CD36. ■ • v 

85 v A method as claimed in claim 84 wherein 
said phagocyte has been transfected with a vector as 
shown in Figure 31. 

25 

86. A method as claimed in any of claims 79 to 
85 wherein said apoptotic cells comprise the adherent 
cell-line PCI 2. 

30 87. A method as claimed in any one of claims 79 

to 85 wherein said apoptotic cells comprise the 
growth factor dependent mouse cell-line Ba/F3. 

88. A method as claimed in claim 26 wherein 
35 said Ba/F3 cells are rendered apoptotic by culturing 
in the absence of the growth factor IL-3. 
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89. A method as claimed in claim 88 wherein 
said cells are considered apoptotic if about 20% 
annexin positive and less than about 5% propidium 
iodide negative. 

5 

90. A method as claimed in any of claims 79 to 
89 wherein said reporter gene is selected from a gene 
encoding p-galactosidase, a gene encoding a 
•fluorescent protein or a gene encoding a protein 

10 capable of generating luminescence. 

. 91 . A method as claimed in claim 90 wherein 
said protein capable of generating luminescence is 
; lucif erase . 

15 

92. A method as claimed in claim 91 wherein 
said- apoptotic cell has been stably transfected with 
a plasmid exhibiting the expression characteristics 
of PGL2 control shown, on Figure 19. 

: 2o \-^v;- ; .; : -,-: , . " ."■ ; : . . 

V''"-' 93. A method as claimed in claim 92 wherein 
said apoptotic cell has been, stably transfected with 
a plasmid comprising the sequence of deoxynucleotides 
shown in Figure 19. 

25 ■ - 

; 94. A method as claimed in any one of claims 79 
to 89 wherein fluorescent protein is green 
fluorescent protein (GFP) ♦ 

30 v - - -t v 95 . A method as claimed in claim 94 wherein 

said apoptotic cell has been stably transfected with- 
a plasmid exhibiting the expression characteristics 
or a plasmid as shown in Figure 10 or Figure 29. 

35 96. A method as claimed in claim 95 wherein 

said apoptotic cell has been transfected with a 
plasmid comprising the sequence of nucleotides shown 
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in Figure 9 of Figure 28. 

97. A method as claimed in any of claims 79 to 
89 wherein said apoptotic cells have been stably 

5 transfected with a plasmid expressing b- 
galactosidase . 

98. A method as claimed in claim 97 wherein 
said plasmid has the expression characteristics of. 

10 the plasmid shown in Figure 11. 

99. A method as claimed in claim 98 wherein 
said plasmid comprises the sequence of 

15 deoxynucleotides shown in Figure 11. 

100. A method as claimed in any of claims 79 to 
89 wherein said apoptotic particles comprise cell- 
line Ba/F3 stably transfected with (J-galactosidase . . 

20 

101. A method as claimed in claim 100 wherein 
the level of phagocytosis is detected by adding a 
substrate, which is converted by said b-galactosidase 
to a fluorescent compound. 

25 

102. A method as claimed . in any one of claims 
79 to 101 wherein if no phagocytosis or a reduced 
amount of phagocytosis is observed on exposure to the 
test compound, said phagocytes are tested for . 

30 viability. 

103. A method as claimed in any of claims 79 to 
102 wherein said phagocytes are cultured in multiwell 
plates the apoptotic cells and the test compounds 

35 being added to the individual wells thereof. 



104. A method as claimed in any preceding claim 
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wherein the signal from said reporter gene is 
detected by an automatic plate reader. 

105. A method as claimed in claim 101 wherein 
5 the signal from the reporter gene is detected by an 

automatic plate reader capable of detecting a 
fluorescent' signal, 

106. A compound identified as an inhibitor or . 
10 enhancer of phagocytosis of apoptotic cells by the 

method of any claims 79 to 105. 

107. A method as claimed in any of claims 22 to 
35, 39 to 56, and 58 to 60 wherein phagocytic uptake 

15 is measured by non-microscopic means. 

108. " A method as claimed in claim 107 wherein 
said non-microscopic means is a multi-well plate 
■reader. 

20':--- -.\ ■■ 

109. A method as claimed in claim 108 wherein 
said multi-well plate reader measures luminescence, 
fluorescence or performs spectrophotometric 

• detection. 

110. A method as claimed in any of claims 79 to 
105 having the features of claims 108 and 109. 

* '. V'/' : • v lll . A method for diagnosing a disease 
30 associated with the over- or under-expression of 
human CED-6 in phagocytic cells in an individual, 
which method comprises: 

(a) obtaining a sample of phagocytes from said 
individual, 

35 (b) isolating RNA from said phagocytes, 

(c) preparing cDNA from said RNA, 

(d) performing a first PCR reaction on said - 



25 
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cDNA, 

(e) performing a second (nested) PCR on the 
reaction product of said first PCR 
reaction, 

5 (f) quantitatively and qualitatively measuring 

the presence of CED-6 RNA by analysing the 
reaction products from the first and second 
PCR, 

(g) comparing the amount and type of reaction 
10 products formed in the first and second PCR 

with that of the reaction products formed 
using phagocytes from control individuals. 

112. A method as claimed in claim 111 wherein 
15 said PCR is performed with primers derived from the 
sequence of human CED-6, or derived from the vector 
used in the generation of cDNA. 

113.. A method, as claimed in claim 111 or 112 
20 wherein said first PCR is performed with primers - 
having nucleotide sequences: 

1 ) cgcaaggatccccatgaaccgtgcttttagcaggaag 

2) gatctactaggtactggag 

25 114. A method as claimed in any of claims 111, 

112 or 113 wherein said second PCR is performed with 
primers having nucleotide sequences: 

1) cgcaaggatccccatgaaccgtgcttttagcaggaag 

2) gcggatggtaccgtcgactgctgatacttgagttattctcag 

30 
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consensus 
Seq 

thcll7484 
r65982 
aal59394 
aa369714 



GGTGATGAGC 
GGTGATGAGC 
. . TGATGAGC 



CCTTGGGTTC 
CCTTGGGTTC 
CCTTGGGTTC 



TCGCTCCGAC 
TCGCTCCGAC 
TCGCTCCGAC 



GGTGATGAGC 
. . TGATGAGC 



CCTTGGGTTC 
CCTTGGGTTC 



TCGCTCCGAC 
TCGCTCCGAC 



TGCTAAATTC 
TGCTAAATTC 
TGCTAAATTC 
. GCTAAATTC 
TGCTAAATTC 
TGCTAAATTC 



50 

GCTTGGCCGG 
GCTTGGCCGG 
GCTTGGCCGG 
GCTTGGCCGG 
GCTTGGCCGG 
GCTTGGCCGG 



consensus 
Seq 

thcll7484 
. ^r65982 
aa 15 93 9 4 
aaik'9714 



51 

GTCCACCTTC 
GTCCACCTTC 
GTCCACCTTC 
GTCCACCTTC 
GTCCACCTTC 
GTCCACCTTC 



TCGTGGCCTC 
TCGTGGCCTC 
TCGTGGCCTC 
TCGTGGCCTC 
TCGTGGCCTC 
TCGTGGCCTC 



ACTCGCCACA 
ACTCGCCACA 
ACTCGCCACA 
ACTCGCCACA 
ACTCGCCACA 
ACTCGCCACA 



CGGATCAGAA 
CGGATCAGAA 
CGGATCAGAA 
CGGATCAGAA 
CGGATCAGAA 
CGGATCAGAA 



100 

TCCGGAGCAG 
TCCGGAGCAG 
TCCGGAGCAG 
TCCGGAGCAG 
TCCGGAGCAG 
TCCGGAGCAG 



consensus 
... .-.Seq 
thc'117484 
r65982 
aal5 93 94 
aa36.9714 
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GCAGTTCTCT 
GCAGTTCTCT 
GCAGTTCTCT 
GCAGTTCTCT 
GCAGTTCTCT 
GCAGTTCTNT 



CTATTCTGAG 
CTATTCTGAG 
CTATTCTGAG 
CTATTCTGAG 
CTATTCTGAG 
CTATTCTGAG 



GCTCCTGCGG 
GCTCCTGCGG 
GCTCCTGCGG 
GCTCCTGCGG 
GCTCCTGCGG 
GCTCCTNCGG 



C TGCC GCG 
C TGCC GCG 
C . TGCCGGC . 
C . TGCCGGC . 
CNTGC CNGCG 
C . TGCCGCGC 



ISO 

CTGACTTCCC 
CTGACTTCCC 

TGACTTCCC . 

TGACTTCCC . 

TGACTTCCCG 

TGACTTCCC . 



consensus 
Seq 

thcll7484 
.-.-> : r6598 2 
aal5 93 94 
aa369714 



151 

TGTGTGGNGG 
TGTGTGCGGG 
TGTGTGCGGG 
TGTGTGCGGG 
TGTGTGGNGG 
TGTGTGCGGG 



AGGGAACTCT 
AGGGAACTCT 
AGGGAACTCT 
AGGGAACTCT 
AGGGAACTCT 
AGGGAACTCT 



GGGCAGGCTG 
GGGCAGGCTG 
GGGCAGGCTG. 
GGGCAGGCTG 
GGGCAGGCTG 
GGGCAGGCTG 



GTTTTCTTGG 
GTTTTCTTGG 
GTTTTCTTGG- 
GTTTTCTTGG 
GTTTTCTTGG 
GTTTTNTTGG 



200 

AATGTGTTTA 
AATGTGTTTA 
AATGTGTTTA 
.AATGTGTTTA 
AATGTGTTTA 
AATGTGTTTA 



consensus 



' " . Seq 
r65983rcc 
■the 117484 
; r65982 

aal59394 

aa3*69714 



consensus 
; : Seq 
r65983rcc 
the 1174 84 
' r'65982 
aal59394 



201 

CGAT . GTTGA 



ATGGGACTTG AACAGG . . AA 



250 

GCTGGACGCT GCA , GCTGGA 



primer OGA103 



primer OGA104 

CGAT GTTGA ATGGGACTTG AACAGG AA 

..... . CTTG AAACGGGNAA 

ATGGGACTTG AACAGG . . AA 
ATGGGACTTG AACAGG . . AA 
ATGGGACTTG AACAGG. .AA 
ATGGGACTTG AACAGG . . AA 



CGAT . GTTGA 
CGAT . GTTGA 
CGAT . GTTGA 
CGATTGTTGA 



GCTGGACGCT 
CCGGGCCNCT 
GCTGGACGCT 
GCTGGACGCT 
GCTGGACGCT 
GCTGGACGCT 



251 

ACTAGCGTGC 
ACTAGCGTGC 
ACTACCGTGC 
ACTAGCGTGC 
ACTAGCGTGC 
ACTAGCGTGC 



C . AAGTT ATT 
C AAGTT ATT 
CCAAGTTATT 
C . AAGTTATT 
C . AAGTTATT 
C. AAGTTATT 



TATGATTCC . 
TATGATTCC 
TATGANCCCC 
TATGATTCC . 
TATGATTCC . 
TATGATTCC . 



ATCTGATATA 
ATCTGATATA 
ACCTGATATA 
ATCTGATATA 
ATCTGATATA 
ATCTGNTATA 



GCA GCTGGA 
GCAAGCNGGA 
GCA . GCTGGA 
GCA . GCTGGA 
GCA . GCTGGA 
GCA. 

- 300 
CATAGGAGAG 
CATAGGAGAG 
CATGGGAGAG 
CATAGGAGAG 
CATAGGAGAG 
CATAGGAGAG 



consensus 



Seq 
OGA102 
r65983rcc 
thcll7484 
r65982 
aal593'94 



301 350 
AAACT GATA GAAGAATTCT GATGGCAACT GTATGATAG AA GCTAT AT 

primer 44 5 - 10934 -02F 
primer oGA102 

AAACT GATA GAAGAATTCT GATGGCAACT GTATGATAG AAGCTAT AT 

.* TA 

AAACT. GATA GAAGAATTCT GATGGCAACT .GTATGATAG AAGCTAT. AT 
AAACT. GATA GAAGAATTCT GATGGCAACT .GTATGATAG AAGCTAT. AT 
AAACT. GATA GAAGAATTCT GATGGCAACT .GTATGATAG AAGCTAT. AT 
AAACTTGATA GAAGAATTCT GATGGCAACT .GTATGATAG AAGCTAT. AT 

SUBSTITUTE SHEET (RULE 26) 



WO 99/64586 



PCT/EP99/04043 





hCED-6 



consensus 
Sea 
OGA102 
r65983rcc 
thcll7484 
r65982 
aal59394 



consensus 

Seq 
OGA102 
r65983rcc 
thcll7484 
T65982 
aal59394 



351 

AAAGTCAAGT 



GTCCATTTTC 



AAAGTCAAGT 
CAAGTCA . GT 
AAAGTCAAGT 
AAAGTCAAGT 
AAAGTCAAGT 
AAAGTCAAGT 



GTCCATTTTC 
GTCCATTTTC 
GTCCATTTTC 
GTCCATTTTC 
GTCCATTTTC 
GTCCATTTTC 



TTTCAACTAT 
TTTCAACTAT 
TTTCAACTAT 
TTTCAACTAT 
TTTCAACTAT 
TTTCAACTAT 
TTTCAACTAT 



ATTTGAGCAT 
ATTTGAGCAT 
ATTTGAGCAT 
ATTTGAGCAT 
ATTTGAGCAT 
ATTTGAGCAT 
ATTTGAGCAT 



400 

ACCCAGGATT 
ACCCAGGATT 
ACCCAGGATT 
ACCCAGGATT 
ACCCAGGGTT 
ACCCAGGGTT 
ACCCAGGATT 



M 



401 

TAAGTCGTGG 

TAAGTCGTGG 
TAAGTCGTGG 
TAAGTCGTGG 
TAAGTCGTGG 
TAAGTCGTGG 
TAAGTCGTGG 



AACTGAACAT 

AACTGAACAT 
AACTGAACAT 
AACTGAACAT 
AACTGAACAT 
AACTGAACAT 
AACTGAACAT 



N R 
450 

TTATTTGG CT GATCCTCATC ATG . AACC GT 

Primer 445-10934-07-R 
TTATTTGGCT GATCCTCATC ATG AACCGT 
TTATTTGGCT GATCCTCATC ATG . AACCGT 
TTATTTGGCT GATCCTCATC ATG . AACCGT 
TTATTTGGCT GATCCTCATC ATGG AACCGT 
TTATTTGGCT GATCCTCATC ATGGAACCGT 
TAT 



CED-6 

CED - 6 
hCED-6 



MAKDIYKTFK RSVSGIVGGN NINGEGSSSP STSAPQVKYR GGTG ... 



consensus 
Seq 

oGA102 
r65983rcc 
thc!17484 

r65982 



A F S 
451 

GCTTTTAGCA 
GCTTTTAGCA 
GCTTTTAGCA. 
GCTTTTAGCA 
GCTTTTAGCA 
GCTTTTAGCA 



R K K D 

GGAAGAAAGA 
GGAAGAAAGA 
GGAAGAAAGA 
GGAAGAAAGA 
GGAAGAAAGA 
GGAAGAAAGA 



R 
K 



T 
T 



W 

w 



CAAAACATGG 
CAAAACATGG 
CAAAACATGG 
CAAAACATGG 
CAAAACATGG 
CAAAACATGG 



I H P 
M H T 

ATG CAT AC AC 
ATG CAT AC AC 
ATGCATACAC 
ATGCATACAC 
GTGCTNACAC 
GTGCTNACAC 



P D Y L 
PEAL 
500 

CTGAAGCTTT 
CTGAAGCTTT 
CTGAAGCTTT 
CTGAAGCTTT 
CTGAAG . NTT 
CTGAAG . NTT 



CED - 6 
hCED-6 



consensus 
Seq 

OGA102 
r65983rcc 
thcll7484 

r65982 



I N G 

S K H 
501 

ATCAAAACAT 
ATCAAAACAT 
ATCAAAACAT 
ATCAAAACAT 
ATCAAAAC.N 
ATCAAAAC . N 



HVE YVA R FLG 

FIP YNAK FLG 

TTCATTCCCT ATAATGCAAA GTTTCTTGGC 
TTCATTCCCT ATAATGCAAA GTTTCTTGGC 
TTCATTCCCT ATAATGCAAA GTTTCTTGGC 
TTCATTCCCT ATAATGCAAA GTTTCTTGGC 

TTCTTTCCNA TTT . . , . 

TTCTTTCCNA TTT. 



C V E 

.S T E 

550 

AGTACAGAAG 
AGTACAGAAG 
AGTACAGAAG 
AGTACAGAAG 



CED-6 
hCED-6 



consensus 
Seq 
OGA10 2 
r65983rcc 



T P K A 

V E Q P 
551 

TGGAACAGCC 
TGGAACAGCC 
TGGAACAGCC 
TGGAACAGCC 



N 

K 



G 

G 



S 
T 



D 

E 



V A 

V V 



AAAAGGAACA GAAGTTGTGA 
AAAAGGAACA GAAGTTGTGA 
AAAAGGAACA GAAGTTGTGA 
AAAAGGAACA GAAGTTGTGA 



R E A I 

R D A V 

GAGATGCTGT 
GAGATGCTGT 
GAGATGCTGT 
GAGATGCTGT 



H A I 

R K L 
600 

AAGGAAACTA 
AAGGAAACTA 
AAGGAAACTA 
AAGGAAACTA 



CED-6 
hCED-6 



consensus 
Seq 

OGA102 
r65983rcc 

r76373 



R F Q 

K . F A 
601 

AAGTTTGCAA 
AAGTTTGCAA 
AAGTTTGCAA 
AAGTTTGCAA 
. AGTTTGCAA 



R D h K 

R H I K 

GACATATCAA 
GACATATCAA 
GACATATCAA 
GACATNTCAA 
GACATATCAA 



R 

K 



S E 
S E 



GAAATCTGAA 
GAAATCTGAA 
GAAATCTGAA 
GAAATCTGAA 
GAAATCTGAA. 



QTRETAK 
G Q K 

GGCCAGAAAA 
GGCCAGAAAA 
GGCCAGAAAA 
GGCCAAAAAA 
GGCCAGAAAA 



L Q K V 

I P K V 
650 

TTCCTAAAGT 

TTCCTAAAGT 

TTCCTAAAGT 

AAAAAAAAG. 

TTCCTAAAGT 



SUBSTITUTE SHEET (RULE 26) 



WO 99/64586 PCT/EP99/04043 




CED-6 
hCED-6 



consensus 
Seq 
OGA102 
r76378 



E I R 

E L Q 
651 
GGAGTTGCAA 
GGAGTTGCAA 
GGAGTTGCAA 
GGAGTTGCAA 



* * 

I S 

I s 



* 

I 

I 



D 

Y 



N 

G 



V 
V 



I 

K 



* 

I A 

I L 



D 
E 



ATATCAATTT ATGGAGTAAA 
ATATCAATTT ATGGAGTAAA 
ATATCAATTT ATGGAGTAAA 
ATATCAATTT ATGGAGTAAA 



AATTCTAGAA 
AATTCTAGAA 
AATTCTAGAA 
AATTCTAGAA 



I K T 
P K T 

700 
CCCAAAACAA 
CCCAAAACAA 
CCCAAAACAA 
CCCAAAACAA 



CED-6 
hCED-6 



consensus 

■ Se< 3 
OGA102 

r76378 

d82787 



KAPM YTF PLG R I S . F CAD 

KEVQ HNC QLH RISF CAD 

701 750 

AGGAAGTTCA ACACAATTGC CAGCTTCATA GAATATCTTT TTGTGCAGAT 
AGG 



AGG ; 

AGGAAGTTCA ACACAATTGC CAGCTTCATA GAATATCTTT TTGTGCAGAT 
. CAATTGC CAGCTTCATAGNAATATCTTGGGNGTGCAGAT 



CED-6 
hCED-6 



consensus 
r76378 
d82787 



DKD D KRM F S F IARAEGAS 

DK T DKRI FTF ICK DSES 
7 51 8 0 0 

GATAAAACTG ACAAGAGGAT ATTCACTTTC ATATGCAAAG ATTCTGAGTC 
GATAAAACTG ACAAGAGGAT ATTCACTTTC ATATGCAAAG ATTCTGAGTC 
GATAAAACTG ACAAGAGGAT ATTCACTTTC ATATGCAAAG ATTCTGAGTC 



CED-6 
hCED-6 



consensus 
Seq 
. OGA102 
r76378 
aa307982 
d82787 



GKP S CY AFTS EKL AED 

NKH LCY VFDS E K C , A E E 
801 850 
AAATAAACAT TTGTGCTATG TATTTGACAG CGAAAAGTGT GCTGAAGAGA 

CTGAAGAGA 

CTGAAGAGA 

AAATAAACAT TTGTGCTATG TATTTGACAG CGAAAAGTGT GTAAGTATCC 

GT GCTGAAGAGA 

AAATAAACAT TTGTGCTATG TATTTGACAG CGMAAAAGTGTGCTGAAGAGA 



CED - 6 
hCED - 6 



consensus 
pGAlOl 
Seq 
OGA102. 
\, r76378 
aa307982 
d82787 



ITLT IGE AFD 
ITLT IGQ AFD 
851 

TCACTTTAAC AATTGGCCAA GCATTTGA . . 



TCACTTTAAC 
TCACTTTAAC 
CAGATGTTGT 
TCACTTTAAC 
TCACTTTAAC 



AATTGGCCAA GCATTTGA 
AATTGGCCAA GCATTTGA... - 
AGGGGTGGTT TGTTCTGTTT 
AATTGGCCAA GCATTTGA. . 
AATTGGCCAA GCATTTGNN 



LAY 

h A Y 

CCTGGCATAC 

CCTGGCATAC 
CCTGGCATAC 
TATAAGNCC 
CCTGGCATAC 
CTGGCATAC 



K R F 

R K F 

900 
AGGAAATTTC 
TC 

AGGAAATTTC 
AGGAAATTTC 
GGGGATTGTC 
AGGAAATTTC 
AGGAAATTTC 



CED-6 
hCED-6 



consensus 



pGAlOl 
Seq 
OGA102 
aa307982 
d82787 



L D K 

L E S 
901 

TAGAA . TCAG 



NRTS LE N Q K 
GGKD VE T RK 

GAGGAAAAGA TGTTGAAACA AGAAAA . 



Q I Y I 
Q I A G 
950 

AGATCGCAGG 



Primer OGA107-F 
Primer 445-10934-04 

Primer 445-10934 -08-R 



TAGAA 
TAGAA 
TAGAA 
TAGAA 
TNG7UV 



TCAG 
TCAG 
TCAG 
TCAG 
TCAG 



GAGGAAAAGA 
GAGGAAAAGA 
GAGGAAAAGA 
GAGGAAAAGA 
GAGGAAAAGA 



TGTTGAAACA AGAAAA C 
TGTTGAAACA AGAAAA C 
TGTTGAAACA AGAAAA. . .C 
TGTTGAAACA AGAAAA . . . C 
TGTTGAAACA AGAAAA . . . C 



AGATCGCAGG 
AGATCGCAGG 
AGATCGCAGG 
AGATCGCAGG 
AGATCGCAGG 



SUBSTITUTE SHEET (RULE 26) 



WO 99/64586 



PCT/EP99/04043 



V 




CED-6 
hCED-6 



consensus 
pGAlOl 
Seq 
OGA107 
OGA10 2 
T76378 
aa307982 
d82787 



L K K K I V 
L Q K R I Q 
951 

GTTACAAAAA AGAATCCAAG 
GTTACAAAAA AGAATCCAAG 
GTTACAAAAA AGAATCCAAG 

GTTACAAAAA AGAATCCAAG 

CTTG 

GTTACAAAAA AGAATCCAAG 
GTTACAAAAA AGACTCCANG 



ELET E N Q VLI 
DLET ENM ELK 

1000 

ACTTAGAAAC AG . AAAATAT GGAACTTAAA 
ACTTAGAAAC AG AAAATAT GGAACTTAAA 
ACTTAGAAAC AG AAAATAT GGAACTTAAA 

AAA 

ACTTAGAAAC AGGAAAATAT GGAACTTAAA 



ACTTAGAAAC AG . AAAATAT GGAACTTAAA 
ACTTAGAAAC AG AAAATAT GGT 



CED-6 
hCED-6 



consensus 
pGAlOl 
Seq 
OGA107 
OGA102 
aa307982 



I E R 
N K V 
1001 

AATAAAGTAC 
AATAAAGTAC 
AATAAAGTAC 
AATAAAGTAC 
AATAAAGTAC 
AATAAAGTAC 



L 
Q 



A E 
D L 



A 
E 



L 
N 



R A 
0 L 



N 
R 



S K 
I T 



A. AGATTTGG 
A AGATTTGG 
A AGATTTGG 
A AGATTTGG 
A . AGATTTGG 
A . AGATTTGG 



AAAACCAACT 
AAAACCAACT 
AAAACCAACT 
AAAACCAACT 
AAAACCAACT 
AAAACCAACT 



GAGAATAACT 
GAGAATAACT 
GAGAATAACT 
GAGAATAACT 
GANAATAACT 
GAGAATAACT 



A D Y 
Q V S 

1050 
CAAGTATCAG 
CAAGTATCAG 
CAAGTATCAG 
CAAGTATCAG 
CAAGTATCAG 
CAAGTATCAG 



CED-6 
hCED-6 



consensus 
pGAlOl 
Seq 
OGA107 
OGA102 
aa307982 



E N T G • P 
A P P A G 
1051 

CACCTCCAGC AGG.CA. 
CACCTCCAGC AGG CA 
CACCTCCAGC AGG CA 
CACCTCCAGC AGG CA 
CACCTCCAGC AGGGCAA 
CACCTCCAGC, AGGCA. . 



P 

S 



I 

M 



Y P 

T P 



G 

K 



GT ATGACACCTA 
GT ATGACACCTA 
GT ATGACACCTA 
GT ATGACACCTA 
GT ATGACACCTT 
GT ATGACACCTA 



L 

S 



G 
P 



AG . . TCGCCC 
AG TCGCCC 
AG TCGCCC 
AG TCGCCC 
AAGTTCGCCC 
AG .".TCGCCC 



P P A 
ST D 
1100 
TCCACT . GAC 
TCCACT GAC 
TCCACT GAC 
TCCACT GAC 
TCCACTTGAC 
TCCACT . GAC 



CED-6 
hCED-6 



LPLSPM 



consensus 
pGAlOl 
Seq 
OGA107 
OGA102 
aa307982 
aa443368 



P Q G 

I F D 
1101 

ATCTTTGATA 
ATCTTTGATA 
ATCTTTGATA 
ATCTTTGATA 
ATCTTTGATA 
ATCTTTG . AT 



P P P 
M I P 

. TGATTCCAT 
TGATTCCAT 
TGATTCCAT 
TGATTCCAT 
ATGATTCCCT 
ATGATTCCAT 
CCAT 



N I 
F S 



P 

? 



P 
I 



S 

s 



s 

H 



TTTC . . TCCA 
TTTC TCCA 
TTTC TCCA 
TTTC TCCA 
TTTTCTTCCA 
TTTCT . . .CC 
TTTCT CC 



AT AT . CACAC 
ATAT CACAC 
ATAT CACAC 
ATAT CACAC 
ATATTCACAC 
AATATCACAC 
AATATCACA 



I Y S 
Q S S 
1150 
C . AGTCTTC . 
C AGTCTTC 
C AGTCTTC 
C AGTCTTC 
CCAGTATTCN 
C . AGTCTTC . 
CCAGTCTTC . 



CED-6 
hCED - 6 



consensus 

pGAlOl 
Seq 
OGA10 7 
OGA10 2 
aa307982 
aa443368 
aa431995 



M P R 
MPT 
1151 

GATGCCTAC . 

GATGCCTAC 

GATGCCTAC 

GATGCCTAC 

GATGCCTTCC 

GATGCCTAC. 

GATGCCTAC. 



A N 
R N 



D L P PTE 

G T Q P P P 



TCGCAAT . . GGCACACAGC C ■• ACCTC. CA 



M A P 

V P S 

1200 
GTACCTAGTA 



primer 445 -10934 - 13 -R 



TCGCAAT 
TCGCAAT 
TCGCAAT 

TTCGCAATTG 

T . CGCAAT . . 

TCGCAAT 

. CAGCAAGTC 



GGCACACAGC 
GGCACACAGC 
GGCACACAGC 
GNACCACAGC 
GGCACACAGC 
GGCACACAGC 
AACATTTGAC 



C ACCTC CA 
C ACCTC CA 
C ACCTC CA 
CCACCTTNCA 
C . ACCTC . CA 
C ACCTC CA 
ATATAGTTAT 



GTACCTAGTA 

GTACCTAGTA 

GTACCTAGTA 

GTTCCTTAGT 

GTACCTAG . . 

GTACCTAGTA 

TTATTAGTTG 



CED-6 
hCED-6 



consensus 
pGAlOl 



TLPQ IS T SSN GA SP SVS 

R STE.IKR DLF GA EP FDP 

1201 1250 

GATCTACTGA GATTAAACGG GACCTGTTTG GAGCAGAACC TTTTGACCCA 

GATCTACTGA GATTAAACGG GACCTGTTTG GAGCAGAACC TTTTGACCCA 

SUBSTITUTE SHEET (RULE 26) 



WO 99/64586 



PCT/EP99/04043 




Seq 
OGA107 
OGA102 
aa443368 
aa431995 



GATCTACTGA GATTAAACGG GACCTGTTTG GAGCAGAACC TTTTGACCCA 
GANCTACTGA GATTAAANGG GACCTGTTTG GAGCAGAACC TTTTGACCCA 

NANG 

GATCTACTGA GATTAAACGG GACCTGTTTG GAGCAGAACC TTTTGACCCA 
ATCAAAGCAT GAATATTTCA ACTTTAGTGT TCACTGATTT TATTTTGCTG 



CED-6 
hCED-6 



consensus 

pGAlOl 
. Seq 
OGA107 
. aa443368 
aa431995 



PAS TSPS GPA 
FNC GAAD FPP 

1251' 

TTTAA CTGTG GAGCAGCAGA TTTCCCTCC A 

Primer 0GAIO8-F 
TTTAACTGTG GAGCAGCAGA TTTCCCTCCA 
TTTAACTGTG GAGCAGCAGA TTTCCCTCCA 
TTTAACTGTG GAGCAGCAGA TTTCCCTCCA 
TTTAACTGTG GAGCAGCAGA TTTCCCTCCA 
TAACATTT CATAGTC TTTTTTA CA 



P S I P P P A 

D I Q S . K h D 

1300 

GATATTCAAT CAAAATTAGA 



GAT ATTCAAT 
GATATTCAAT 
GATATTCAAT 
GATATTCAAT 
GATATTAATT 



CAAAATTAGA 
CAWAATTAGA 
CAAAATTAGA 
CAAAATTAGA 
ATTTTATTCT 



CED-6 
hCED-6 



consensus 
•pGAlOl 
' Seq 
OGA107 
OGAIOS 
aa4_43 3 68 
aa431995 



S T S 
E M Q 
1301 

TGAGATGCAG 
TGAGATGCAG 
TGAGATGCAG 
TGAGATGCAG 
G 

TGAGATGCAG 
GTTTTA . CAG 



P S G PAPS 
E G F K M G- L 

GAGGGGTTCA AAATGGGACT 
GAGGGGTTCAAAAATGGGACT 
GAGGGGTTCA AAATGGGACT 
GAGGGGTTCA AAATGGGACT 
GAGGGGTTCA AAATGGGACT 
GAGGGGTTCA AAATGGGACT 
GAGGGGTTCA AAATGGGACT 



I P P P R P 
T h E G T V 

1350 

AACTCTTGAA GGCACAGTAT 
AACTCTTGAAAGGCACAGTAT 
AACTCTTGAA GGCACAGTAT 
AACTCTTGAA GG C AC AAGTAT 
AACTCTTGAA GGCACAGTAT 
AACTCTTGAA GGCACAGTAT 
AACTCTTGAA GGCACAGTAT 



CED-6 
hCED-6 



consensus 

Seq 
6GA107 
OGA108 
aa443368 
aa431995 



P A L A 
F C L D 
1351 

TTTGTCTCGA 

TTTGTCTCGA 
TTTGTCTCGA 
TTTGTCTCGA 
TTTGTCTCGA 
TTTGTCTCGA 



P P P P V 
P h D S R 



*** ■ 

C * 



CCCGTTAGAC AGTAGGTC-CT GACATCAAGA 



primer 445-1093 4-11 -F 



1400 
ACAAGAAATC 



CCCGTTAGAC 
CCCGTTAGAC 
CCCGTTAGAC 
CCCGTTAGAC 
CCCGTTAGAC 



AGTAGGTGCT GACATCAAGA 
AGTAGGTGCT GACATCAAGA 
AGTAGGTGCT GACATCAAGA 
AGTAGGTGCT GACATCAAGA 
AGTAGGTGCT GACATCAAGA 



ACAAGAAATC 
ACAAGAAATC 
ACAAGAAATC 
ACAAGAAATC 
ACAAGAAATC 



CED-6 ...PRRNPVyS PKNSTAGLLD GLELGSAEPA KKAPSNIFDD 

CED-6 . S FD tRAGEKK STAAEYNPFG ADFLSGIQNG KEAPPSASAE LLASEAIARL PKPESSSVPP ■ 
CED-6 KKTAAEYDAM INEVEKKLAA MSSGSFEFGQ LQTGDLGGIE GESDYGTPSD RLNPKMMNLKQ 



consensus 

Seq 
OGA107 
OGA109 
■ OGA108 
•aa44 3 3 68 
aa431995 
r33389 



14 01- 

CTGATTCATG 



TTAAATGTGT 



1450 

TTGTATAC . A CATGTCATTT ATTATTATTA 



primer oGA109-F 



CTGATTCATG 
CTGATTCATG 

CTGATTCATG 
CTGATTCATG 
CTGATTCATG 
CTGATTCATG 



TTAAATGTGT 
TTAAATGTGT 

TTAAATGTGT 
TTAAATGTGT 
TTAAATGTGT 
TTAAATGTGT 



TTGTATAC A 
TTGTATAC A 

TTGTATAC A 
TTGTATAC . A 
TTGTATAC . A 
TTGTATAC . A 



CATGTCATTT 
CATGTCATTT 
ACTGTTCATT 
CATGTCATTT 
CATGTCATTT 
CATGTCATTT 
CATGTCATTT 



ATTATTATTA 

ATTATTATTA 

ATTATTATT 

ATTATTATTA 

ATTATTATTA 

ATTATTATTA 

ATTATTATTA 



consensus 
Seq 
OGA107 
OGA109 
OGA108 
aa443368 



1451 1500 
CTTTAAGATA GGTATTA.TT CATGTGTCAA TGTTTTTGAA TATTTTAATA 
CTTTAAGATA GGTATTA TT CATGTGTCAA TGTTTTTGAA TATTTTAATA 
CTTTAAGATA GGTATTA TT CATGTGTCAA TGGTTTTTTGAATATTTTAATA 
CTTTANNAA GGTTATTATT NTGCGNTCA GNTTTTNTAA TATTTTAATA 
CTTTAAGATA GGTATTA TT CATGTGTCAA TGTTTTTGAA TATTTTAATA 
CTTTAAGATA GGTATTA TT CATGTGTCAA TGTTTTTGAA TATTTTAATA 



SUBSTITUTE SHEET (RULE 26) 



WO 99/64586 



PCT/EP99/04043 



aa431995 
r53881 
r62236 
hO 3 74 9 
r33389 



CTTTAAGATA GGTATTA TT CATGTGTCAA TGTTTTTGAA TATTTTAATA 

; . . GATA 

.... AAGATA GGTATTA . TT CATGTGTCAA TGTTTTTGAA TATTTTAATA 
."- . TAAGATA GGTATTA. TT CATGTGTCAA TGTTTTTGAA TATTTTAATA 
CTTTAAGATA GGTATTA. TT CATGTGTCAA TGTTTTTGAA TATTTTAATA 



consensus 
Seq 
OGA10 9 
OGA108 
aa443368 
aa431995 
r53881 
r62236 
h03749 
r33389 



1501 

TTTTGAAAAT 
TTTTGAAAAT 
TTTNTAAAAT 
TTTTGAAAAT 
TTTTGAAAAT 
TTTTGAAAAT 
TTTTGAAAAT 
TTTTGAAAAT 
TTTTGAAAAT 
TTTTGAAAAT 



TTTCTCAGTT 
TTTCTCAGTT 
TTTCTCANTT 
TTTCTCAGTT 
TTTCTCAGTT 
TTTCTCAGTT 
TTTCTCAGTT 
TTTCTCAGTT 
TTTCTCAGTT 
TTTCTCAGTT 



AAATTTCCT 
AAATTTCCT 
AAATTTCCT 
AAATTTCCT 
AAATTTCCT 
AAATTTCCT 
AAATTTCCT 
AAATTTCCT 
AAATTTCCT 
AAATTTCCT 



CACCT. . 
CACCT 
CACCT 
CACCT 
CACCT 
CACCT 
CACCT. . 
CACCT. . 
CACCT. . 
CACCT- . 



1550 

.T CACTATTGAT 

T CACTATTGAT 

T CACTATTMNAT 

T CACTATTGAT 

T CACTATTGAT 

T CACTATTGAT 

"T CACTATTGAT 

.T CACTATTGAT 

.t CACTATTGAT 

'.T CACTATTGAT 



consensus 



1551 

CTGTAATTTT 



TATTTTAAAA ACAGCTTACT G , . . TAAAGT 



1600 
AGA . . TCATA 



primer 44 5 - 10934 - 03 -R 



Seq 


CTGTAATTTT 


TATTTTAAAA 


ACAGCTTACT 


G 


TAAAGT 


AGA 


TCATA 


OGA109 


CGTTAATTTT 


TATTTTAAAA 


ACNTCTTACN 


T 


TAANTT 


NNA 


TCATA 


OGA10 8 


CTGTAATTTT 


TATTTTAAAA 


ACAGCTTACT 


G 


TAAAGT 


AGA 


TCATA 


aa443363 


CTGTAATTTT 


TATTTTAAAA 


ACAGCTTACT 


GT 








aa431995 


CTGTAATTTT 


TATTTTAAAA 


ACAGCTTACT 


G 


TAAAGT 


AG A 


TCATA 


r53881 


CTGTAATTTT 


TATTTTAAAA 


ACAGCTTACT 


G . . 


. TAAAGT 


AG A . 


TCATA 


r62236 


CTGTAATTTT 


TATTTTAAAA 


ACAGCTTACT 


G. \ 


.TAAAGT 


AG. A. 


TCATA 


. h03749 


CTGTAATTTT 


TATTTTAAAA 


ACAGCTTACT. 


G. . 


.TAAAGT 


AG. A. 


TCATA 


r33389 


CTGTAATTTT 


TATTTTAAAA 


ACAGCTTACT " 


G. . 


. TAAAGT 


AGGA . 


TCATA 



consensus 
Seq 
OGA10 9 
OGA108 
aa431995 
r53881 
r62236 
h03749 
r33389 



1601 

CTTTT . . A TG 
CTTTT ATG 
CTTTT ANN 
CTTTT ATG 
CTTTT ATG 
CTTTT, .ATG 
CTTTT. .ATG 
CTTTT . . ATG 
CTTTT . . ATG 



TTCCTTTCTG 
TTCCTTTCTG 
TTCCTTTCGA 
TTCCTTTCTG 
TTCCTTTCTG 
TTCCTTTCTG 
TTCCTTTCTG 
TTCCTTTCTG 
TTCCTTTCTG 



TTTCTACTGT 

TTTCTACTGT 

TTTCTACGC 

TTTCTACTGT 

TTTCTACTGT 

TTTCTACTGT 

TTTCTACTGT 

TTTCTACTGT 

TTTCTACTGT 



AGAT 
AGAT 
TNNA 
AGAT 
AGAT 
AGAT 



GAAT 
GAAT 
GNAA 
GAAT 
GAAT 
GAAT 



AGAT . . GAAT 
AGAT . . GAAT 
AGGATGGAAT 



1650 
TTGTAATTGA 
TTGTAATTGA 
TTGNTAATTATA 
TTGTAATTGA 
TTGTAATTGA 
TTGTAATTGA 
TTGTAATTGA ' . 
TTGTAATTGA 
TTGTAATTGG 



consensus 

Seq 
OGA10 9 
OGA108 
aa431995 
r53 881 
r622 3 6 
h03749 
r33389 



1651 

AAG . ACATAT 

AAG ACATAT 
ANT ACATAT 
AAG ACATAT 
AAG ACATAT 
AAG . ACATAT 
AAG . ACATAT 
AAG . ACATAT 
AAGGACATAT 



TATACAAA TA CCTGCOTTGT GTCTGAG . TT 

primer 445-109 34-0 6-F 
TATACAAATA CCTGCCT.TGT GTCTGAG TT 
TATACAAATA CCGACCTTANGATCTNNNTTT 
TATACAAATA CCTGCCTTGT GTCTGAG. TT 
TATACAAATA CCTGCCTTGT GTCTGAG TT 
TATACAAATA CCTGCCTTGT GTCTGAG . TT 
TATACAAATA CCTGCCTTGT GTCTGAG. TT 
TATACAAATA CCTGCCTTGT GTCTGAG . TT 
TATACAAATA CCTGCCTTGT GTCTGAGGTT 



/ ■ 1700 
CTATTTAGTT 

CTATTTAGTT 
CTATTTATTT 
CTATTTAGTT 

CTATTTAGTT 
CTATTTAGTT 
CTATTTAGTT 
CTATTAGGTA 



consensus 

Seq 
OGA109 
OGA108 
OGA110 



1701 1750 
AGC.ATCTTG AAATTT GTAT TCATTTTCCA GATGGCTAG T TTATTAATGA 

Primer oGAHO-F 
AGC ATCTTG NAATTTGTAT TCATTTTCC AG GATGGCTAGT TTATTAATGA 
NTC ATCTGT AAATTGATAT TCATTTTCCA TAGGNCTGTTTTATTAAGNAT 
AGC ATCTTG AAATTTGTAT TCATTTTCCA GATGGCTAGT TTATTAATGA 

CTTTAATGA 

SUBSTITUTE SHEET (RULE 26) 



WO 99/64586 



PCT/EP99/04043 



r53881 
r62236 
h0374 9. 
r33389 




AGC.ATCTTG AAATTTGTAT TCATTTTCCA GATGGCTAGT TTATTAATGA 
AGC.ATCTTG AAATTTGTAT TCATTTTCCA GATGGCTAGT TTATTAATGA 
AGC.ATCTTG AAATTTGTAT TCATTTTCCA GATGGCTAGT TTATTAATGA 
GGCCATCTGG AAATTTGTAT TCATT 



consensus 

Seq 
OGA109 
0GAIO8 
oGAHO 
r53881 
r62236 
h03749 



1751 

TTTCCCAAAA GCCATACCTT AAAG . ATAAC 



TTTTTAAATT 
primer 445 



1800 
CTGAAGA . .G 
10934-12-R 



TTTCCCAAAA GCCATACCTT 
TTCCCAAAN TCCATACCTT 
TTTCCCAAAA GCCATACCTT 
TTTCCCAAAA GCCATACCTT 
TTTCCCAAAA GCCATACCTT 
TTTCCCAAAA GCCATACCTT 
TTTCCCAAAA GCCATACCTT 



AAAG ATAAC 
AANT ATAAC 
AAAG ATAAC 
AAAG ATAAC 
AAAG . ATAAC 
AAAG . ATAAC 
AAAGGATAAC 



TTTTTAAATT 
TTTTTAAATT 
TTTTTAAATT 
TTTTTAAATT 
TTTTTAAATT 
TTTTTAAATT 
TTTTTAAATT 



CTGAAGA G 
TNTAATA T 
CTGAAGA G 
CTGAAGA G 
CTGAAGG . . G 
CTGGAAG A . G 
CTGGAAGGNG 



consensus 
Seq 
OGA109 
OGA110 
OGA1 0 8 
r53881 
f62236 
h03749 



1801 

ACATGCCAAT ' G TCAAACTAA 
ACATGCCAAT GTCAAACTAA 
ACANTCCAA GTTCAAACTAA 
ACATGCCAAT GTCAAACTAA 
ACATGCCAAT GTCAAACTAA 
ACATGCCAAT GTCAAACTAA 
ACATGCCAAT GTCAAACTAA 
ACATGCCAAT GTCAAACTAA 



ACATGTTCTG 
ACATGTTCTG 
ACANNTTCGN 
ACATGTTCTG 
ACATGTTCTG 
ACATGTTCTG 
ACATGTTCTG 
ACATGTTCCG 



TTTTTAAA . C 
TTTTTAAA C 
TTTTTAAA C 
TTTTTAAA C 
TTTTTAAA C 
TTTTTAAA . C 
TTTTTAAAAC 
TTTTTAAAAC 



1850 
CAACAAACAT 
CAACAAACAT 
CAACAAACAN 
CAACAAACAT 
CAACAAACAT 
CAACAAACAT 
CAACAAACAT 
CAACAAACAT 



consensus 
;,- Seq 
OGA109 
0GAIO8 
6GA110 
r53 881 
r62236 
h03749 



1851 

GTTA . CTATT 
GTTA CTATT 
NTTA CTATT 
GTTA CTATT 
GTTA CTATT 
GTTA. CTATT 
GTTAACTATT 
GTTA . CTATT 



CATTGG . ACA 
CATTGG ACA 
CATGNGNACA 
CATTGG ACA 
CATTGG ACA 
CATTGGGACA 
TCATGGGACA 
TCATG 



GATATCATTT 
GATATCATTT 
NATATCATTT 
GATATCATTN 
GATATCATTT 
GNTATCCTTT 



TATG 
TATG 
TANA 
NATG 
TATG 



TATA 
TATA 
TATA 
TATA 
TATA 



TATGGTATTA. 



1900 
AATACTGTT . 
AATACTGTT 
AACACTANT ' 
AAT 

AATACTGTT 
AATACTGTTC 



consensus 
Seq 
oGA109 
oGAHO 
.,. r53881 



1901 1950 
CACATCACTG G . GAAAATGT AAACTTT . AA ACATAATGCC ACAAGGTCAC 
CACATCACTG G GAAAATGT AAACTTT AA ACATAATGCC ACAAGGTCAC 
TCACATCACTG GGTAAAAGAT AANCTTT AA ACATAATACCCACANGTTCAC 
CACATCACTG G GAAAATGT AAACTTT AA ACATAATGCC ACAAGGTCAC 
CACCTCACCG GGGGNATGGT AAACCTTNAA ACCTNATGGC CNCAGGGGCA 



consensus 
Seq 
6GA109 
oGAl 1 0 
aa431753rcc 
r53881 



1951 

TAATTTCTAG 
TAATTTCTAG 
TAATTTCTAA 
TAATTTCTAG 

CCNTTTTNCG 



CAGGTAAAAT 
CAGGTAAAAT 
CNGATNAAAT 
CAGGTAAAAT 
. . GGTAAAAT 
GCG 



TATAAGGATA TAAATTCCAA 
TATAAGGATA TAAATTCCAA 
TATANGGNTATAAAATTCCAA 
TATAAGGATA TAAATTCCAA 
TATAAGGATA TAAATTCCAA 



2000 
T AATAAAC C A 
TAATAAACCA 
TAATAAACCCA 
TAATAAACCA 
TAATAAACCA 



consensus 



pGAlOl 
Seq 
OGA109 
oGAHO 
aa431753rcc 
aal59297rcc 



2001 2050 
AATGTATTTA GAGTATTTAT TAGTAAATG C AAGGTGATGT TAGTTATGAT 

primer 445-10934-01 R 
primer 445 -10934-10 -F 

AGTAAATGCCAAGGTGATGGTTAGTTAAGGAT 
AATGTATTTA GAGTATTTAT TAGTAAATGC AAGGTGATGT TAGTTATGAT 
AAG AT ATTT AAG AAT ATTTATTT ANTAACTG C CAGNTGAA 
AATGTATTTA GAGTATTTAT TAGTAAATGC AAGGTGATGT TAGTTATGAT 
AATGTATTTA GAGTATTTAT TAGTAAATGC AAGGTGATGT TAGTTATGAT 
. : t GATGAT 

SUBSTITUTE SHEET (RULE 26) 



WO 99/64586 



PCT/EP99/04043 




2051 

consensus CAGTTATACT CTAAATATTT AATTTGTTTT 
pGAlO 1 CAGGTTAAAACCTCTAAAATATTNAATNTTGTTTT 
Seq CAGTTATACT CTAAATATTT AATTTGTTTT 
CAGTTATACT CTAAATATTT AATTTGTTTT 
CAGTTATACT CTAAATATTT AATTTGTTTT 
CAGTTATACT CNAAATATTN AATTTGTNTT 
.... TATACT CTAAATATTT AATTTGTTTT 
T 



oGAHO 
aa431753rcc 
aal59297rcc 
aa770228rcc 
h02853rcc 



ATAAAGGTAG 
ATAAAGGTAG 
ATAAAGGTAG 
ATAAAGGTAG 
ATAAAGGTAG 
ATAAAGGTAG 
ATAAAGGTAG 
ATAAGGGTAG 



2100 
TGAAAAAATG 
GAAAAAATG 
TGAAAAAATG 
TGAAAAAATG 
TGAAAAAATG 
TGAAAAAATG 
TGAAAAAATG 
NGAAAAAANG 



consensus 

pGAlOl 
Seq 
OGA110 
aa431753rcc 
aal59297rcc 
aa770228rcc 
h02853rcc 
d60819rcc 
r62135rcc 



2101 

AAAATTTGCT ATTTATTAAA AAACATTAAA TTTC. ATTCC 



2150 
. AAATG AG AT 



primer 445 -10934-05-F 



AAAATTTGCT 
AAAATTTGCT 
AAAATTTGCT 
AAAATTTGCT 
AAAATTTGCT 
AAAATTGGCT 
AAAATTNGCT 



ATTTATTAAA 
ATTTATTAAA 
ATTTATTAAA 
ATTTATTAAA 
ATTTATTAAA 
ATTTATTAAA 
ATTTATTAAA 
. ATTATKRAA 



AAACATTAAA 
AAACATTAAA 
AAACATTAAA 
AAACATTAAA 
AAACATTAAA 
AAACATTGAA 
AAACATTAAA 
AAACATTAAA 
AAACATTAAA 



TTTC ATTCC 
TTTC ATTCC 
TTTC ATTCC 
TTTC. ATTCC 
TTTC. ATTCC 
TTTC. ATTCC 
TTTC . ATTCC 
TKTC.ATBCS 
TGTCCANGCC 



AAATGAGAT 
AAATGAGAT 
AAATGAGAT 
. AAATGAGAT 
. AAATGAGAT 
. AAATGAGAT 
CAAATGAGAT 
. AAATGAGAT 
CAAATGAGAT 





2151 


















. 2200 


consensus 


AAGTG 


. ATAT 


TAC 


. TATAAC 


ATC 


. TAAGCA 


TCATCT . 


.GA 


TTTG 


. ATATT 


pGAlOl 


AAGTG 


ATAT 


TAC 


TATAAC 


ATC 


TAAGCA 


TCATCT 


GA 


TTTG 


ATATT 


Seq 


AAGTG 


ATAT 


TAC 


TATAAC 


ATC 


TAAGCA 


TCATCT 


GA 


TTTG 


ATATT 


. oGAHO 


AAGTG 


ATAT 


TAC 


TATAAC 


ATC 


TAAGCA 


TCATCT 


GA 


TTTG 


ATATT 


aa431753rcc 


AAGTG 


. ATAT 


TA.C 


.TATAAC 


ATC 


. TAAGCA 


TCATCT . 


.GA 


TTTG 


. ATATT 


aal59297rcc 


AAGTG 


.ATAT 


TAC 


. TATAAC 


ATC 


. TAAGCA 


TCATCT . 


.GA 


TTTG 


. ATATT 


aa770228rcc 


AAGTG 


.ATAT 


TAC 


.TATAAC 


ATC 


. TAAGCA 


TCATCT . 


.GA 


TTTG 


. ATATT 


h02853rcc 


AAGTG 


.ATAT 


TACCTATAAC 


ATCCTAAGCA 


TCATCT . 


.GA 


TTTG 


. ATANT 


d€0819rcc 


AAGTG 


.ATAT 


TAC 


. TATAAC 


ATC 


. TAAGCA 


TCATCT . 


.GA 


TTTG 


. ATATY 


r62135rcc 


AAGTGGATAJN 


TACCTATAAC 


ATCCTAAGCA 


TCATCCTGNA 


TTTGNANANT 



consensus 
pGAlOl 
Seq 
oGAHO 
aa431753rcc 
aal59297rcc 
aa770228rcc 
h02853rcc 
d60819rcc 
r62135rcc 



2201 

CCCT . AAAAA 
CCCT AAAAA 
CCCT AAAAA 
CCCT AAAAA 
CCCT . AAAAA 
CCCT . NAAAA 
CCCT . AAAAA 
CCCT . AAAAA 
CCCTRAAAAA 
CCCCTNAAAA 



ACATTTGGAA 
ACATTTGGAA 
ACATTTGGAA 
ACATTTGGAA 
ACATTTGGAA 
ACATTTGGAA 
ACATTTGGAA 
ACATTTGGAA 
ASATKTGGRA 
ACATTTGGNA 



TATATGCTAT 
TATATGCTAT 
TATATGCTAT 
TATATGCTAT 
TATATGCTAT 
TATATGCTAT 
TATATGCTAT 
TATATGCTAT 
TATATGCTAT 
TATATGCTAT 



CTATAGATTC 
CTATAGATTC 
CTATAGATTC 
CTATAGATTC 
CTATAGATTC 
CTATAGATTC 
CTATAGATTC 
CTATAGATTC 
CTATAGAKTC 
CTATAGATTC 



2250 
AGTATCTACT 
AGTATCTACT 
AGTATCTACT 
AGTATCTACT 
AGTATCTACT 
AGTATCTACT 
AGTATCTACT 
AGTATCTACT 
AGTATCTACT 
AGTATCTACT 



consensus 
pGAl 0 1 
Seq 
oGAHO 
aa431753rcc 
aal59297rcc 
aa770228rcc 
h02853rcc 
d60819rcc 
r62135rcc 



2251 

ACCCATATTT 
ACCCATATTT 
ACCCATATTT 
ACCCATATTT 
ACCCATATTT 
ACCCATATTT 
ACCCATATTT 
ACCCATATTT 
ACCCATATTT 
ACCCATATTT 



ACTTTACC . A 
ACTTTACC A 
ACTTTACC A 
ACTTTACC A 
ACTTTACC . A 
ACTTTACC . A 
ACTTTACC . A 
ACTTTACC . A 
ACTTTACSSA 
ACTTTACC . A 



AATATATTTC 
AATATATTTC 
AATATATTTC 
AATATATTTC 
AATATATTTC 
AATATATTTC 
AATATATTTC 
AATATATTTC 
AATATATTTC 
AATATATTTC 



TCCTCACTGC 
TCCTCACTGC 
TCCTCACTGC 
TCCTCACTGC 
TCCTCACTGC 
TCCTCACTGC 
TCCTCACTGC 
TCCTCACTGC 
TCCTCACTGC 
TCCTCACTGC 



230 0 
ATAAGGACTA 
ATAAGGACTA 
ATAAGGACTA 
ATAAGGACTA 
ATAAGGACTA 
ATAAGGACTA 
ATAAGGACTA 
ATAAGGACTA 
ATAAGGACTA 
ATAAGGACTA 



2301 



SUBSTITUTE SHEET (RULE 26) 



2350 



WO 99/64586 



PCT/EP99/04043 




consensus 
pGAlOl 
Seq 
oGAHO 
aa431753rcc 
aa!592 97rcc 
aa770228rcc 
h02853rcc 
d60819rcc 
r62135rcc 



CTCTTCTCAT 
CTCTTCTCAT 
CTCTTCTCAT 
CTCTTCTCAT 
CTCTTCTCAT 
CTCTTCTCAT 
CTCTTCTCAT 
CTCTTCTCAT 
CTCTTCTCAT 
CTCNTCTCAT 



ATTTTCTTCT 
ATTTTCTTCT 

ATTTTCTTCT 
ATTTTCTTCT 
ATTTTCTTCT 
ATTTTCTTCT 
ATTTTCTTCT 
ATTTTCTTCT 
ATTTTCTTCT 
ATTTTCTTCT 



TTGATGAAGA 
TTGATGAAGA 
TTGATGAAGA 
TTGATGAAGA 
TTGATGAAGA 
TTGATGAAGA 
TTGATGAAGA 
TTGATGAAGA 
TTGATGAAGA 
TTGATGAAGA 



TATTTTTCAC 
TATTTTTCAC 
TATTTTTCAC 
TATTTTTCAC 
TATTTTTCAC 
TATTTTTCAC 
TATTTTTCAC 
TATTTTTCAC 
TATTTTTCAC 
TATTTTTCAC 



CAAAGTTTAT 
CAAAGTTTAT 
CAAAGTTTAT 
CAAAGTTTAT 
CAAAGTTTAT 
CAAAGTTTAT 
CAAAGTTTAT 
CAAAGTTTAT 
CAAAGTTTAT 
CAAAGTTTAT 



consensus 
pGAlpl 
. . ; Seq 
OGA110 
aa4 31753rcc 
aal592?7rcc 
aa770228rcc 
h02853rcc 
d60819rcc 
r62135rcc 



2351 

TTTGTGATGC 
TTTGTGATGC 
TTTGTGATGC 
TTTGTGATGC 
TTTGTGATGC 
TTTGTGATGC 
TTTGTGATGC 
TTTGTGATGC 
TTTGTGATGC 
TTTGTGATGC 



CCTCTTGGTT 
CCTCTTGGTT 
CCTCTTGGTT 
CCTCTTGGTT 
CCTCTTGGTT 
CCTCTTGGTT 
CCTCTTGGTT 
CCTCTTGGTT 
CCTCTTGGTT 
CCTCTTGGNT 



TTGATACTTT AAAATCTGTG 
TTGATACTTT AAAATCTGTG 
TTGATACTTT AAAATCTGTG 
TTGATACTT AAAAATCTGTG 
TTGATACTTT AAAATCTGTG 
TTGATACTTT AAAATCTGTG 
TTGATACTTT AAAATCTGTG. 
TTGATACTTT AAAATCTGTG 
TTGATACTTT AAAATCTGTG 
TTGATACTTT AAAATCTGTG 



2400 
GCACCCGTTC 
GCACCCGTTC 
GCACCCGTTC 
GCACCCGTTC 
GCACCCGTTC 
GCACCCGTTC 
GCACCCGTTC 
GCACCCGTTC 
GCACCCGTTC 
GCACCCGTTC 



consensus 
pGAib.l 
V^Seq 
OGAIIO 
aa43i753rcc 
aal59297rcc 
aa770228rcc 
h02853rcc 
d608i9rcc 
r62135rcc 



2401 

TACATGAATT 
TACATGAATT 
TACATGAATT 
TACATGAATT 
TACATGAATT 
TACATGAATT 
TACATGAATT 
TACATGAATT 
TACATGAATT 
TACATGNATT 



ATCAATATTT 
ATCAATATTT 
ATCAATATTT 
ATCAATATTT 
ATCAATATTT 
ATCAATATTT 
ATCAATATTT 
ATCAATATTT 
ATCAATATTT 
ATCAATATTT 



GGTAAATTCA 
GGTAAATTCA 
GGTAAATTCA 
GGTAA TTCA 
GGTAAATTCA 
GGTAAATTCA 
GGTAAATTCA 
GGTAAATTCA 
GGTAAAKTCA 
GGTAAATTCA 



ATCTGTATTT 
ATCTGTATTT 
ATCTGTATTT 
ATCTGTATTT 
ATCTGTATTT 
ATCTGTATTT 
ATCTGTATTT 
ATCTGTATTT 
ATCTGTATTT 
ATCTGTATTT 



. .2450 
GTTTTGTTAA 
GTTTTGTTAA 
GTTTTGTTAA 
GTTTGGTAAA 
GTTTTGTTAA 
GTTTTGTTAA 
GTTTTGTTAA 
GTTTTGTTAA 
GTTTTGTTAA 
GTTTTGTTAA 



consensus 
V pGAlOl 

V ' ;.;>oGA110 
aa431753rcc 
aal59297rcc 
aa770228rcc 
h02853rcc 
d60819rcc 
r62135rcc 



2451 2498 

AGTCAAAAAT CTCATTTTCC 

AGTCAAAAAT CTCATTTTCC AGTCGACGCG GCCGC 
AGTCAAAAAT CTCATTTTCC AAAAAAAAAA AAAAAAAACT CGAG 
ATCCAAAAATNNNCATT 

AGTCAAAAAT CTCATTTTCC AAAA. 

AGTCAAAAAT CTCATTTTCC " 

AGTCAAAAAT CTCATTT 

AGTCAAAAAN NTCAANNTCC 

AGTCRAAAAT CTCATTTTCC 

AGTNANNANT CTCATTTTCC AANANGGGGG GGGGGGGGGA AGTTCCTG 



SUBSTITUTE SHEET (RULE 26) 



WO 99/64586 



PCT/EP99/04043 



GGTGATGAGCCCTTGGGTTCTCGCTCCGACTGCTAAATTCGCTTGGCCGGGTCCACCTTCTCGTGGCCT 

cactcgcgacacggatcagaatccggagcaggcagttctctctattctgaggctcctgcggctgccgcg 

CTGACTTCCCTGTGTGGNGGAGGGAACTCTGGGCAGGCTGGTTTTCTTGGAATGTGTTTACGATGTTGA 

ATGGGACTTGAACAGGA.\GCTGGACGCTGCAGCTGGAACTAGCGTGCCAAGTTATTTATGATTCCATC 

TGATATACATAGGAGAG^ACTGATAGAAGAATTCTGATGGCAACTGTATGATAGAAGCTATATAAAG 

TCAAGTGTCCATTTTCTTTCAACTATATTTGAGCATACCCAGGATTTAAGTCGTGGAAC 

TTGGCTGATCCTCATCMSAACCGTGCTTTTAGCAGGAAGAAAGACAAAACATGGATGCATACACCTG 

AAGCTTTATCAAAACATTTCATTCCCTATAATGCAAAGTTTCTTGGCAGTACAGAAGTC 

AAGGAACAGAAGTTGTGAGAGATGCTGTAAGGAAACTAAAGTTTGCAAGACATATCAAGAAATCTGA 

AGGCCAGAAAATTCCTAAAGTGGAGTTGCAAATATCAATTTATGGAGT^ 

CAAAGGAAGTTCAACACAATTC 

AGGATATTGACTTTCATATGCAAAGATTCTGAGTCAAATAAACAlTrGTGCTATGTATTTGACAGCGAA 

AAGTGTG CTGAAGAGATCACTTTAACAATTGGCCAAGCATTTGACCTGGCATACAGGAAATTTCTAGA 

ATCAGGAGGAAAAGATGTTGAAACAAGAAAACAGATCGCAGGGTTACAAAAAAGAATCCAAGACTTA 

GAAACAGAAAATATGGA.\CTTAAAAATAAAGTACAAGATTTGGAAAACCAACTGAGAATAACTCAAG 

TATCAGCACCTCCAGCAGGCAGTATGACACCTAAGTCGCCCTCCACTGACATCTTTGATATGATTCCAT 

TTTCTCCAATATCACACCAGTCTTCGATGCCTACTCGCAATGGCACACAGCCACCTCCAGTACCTAGTA 

GATCTACTGAGATTAAACGGGACCTGTTTGGAGCAGAACCTTTTGACCCATTTAACTGTGGAGCAGCA 

GATTTCCCTCCAGATATTCAATCAAAATTAGATGAGATGCAGGAGGGGTTCA J \AATGGGACTAACTCT 

TGAAGGGACAGTATTTTGTCTCGACCCGTTAGACAGTAGGTGCTGACATCAAGAACAAGAAATCCTGA 

TTCATGrTTAAATGTGTTTGTATAC 

.TCAATGTTTTTGAATATTTTAATATTTT 

GTAATTTTTATTTTAAAAACAGCITACTCT 

AGATGAATTTGTAATTGAAAGACATATTATACAAATACCTGCCTTGTGTCTGAGTTCTATTTAG 

ATCTTGAAATTTGTATTCATTTTCCAGATGGCTAGTTTATTAATGATT^ 

ATAACTTTTTAAATTCTGAAGAGACATGCCAA^ 

CATGTTA CTATTC ATTGG AC AG ATATC ATTTT ATGTAT AA ATACTGTTC A C ATC ACTGGG A AAATGTA A 
ACTTTAAACATAATGCCACAAGGTCAGTAATTTCTAGCAGGTAAAATTATAAGGATATAAATTCCAATA 
ATAAACCAAATGTATTTAGAGTATTTATTAGTAAATGCAAGGTG 
TAAATATTTAATTTGTTTTATAAAGGTAGTGAAAAAATGAAAATT^ 

TTTCATTCCAAATGAGATAAGTGATATTACTATAACATCTAAGCATCATCTGATTTGATATTCCCTAAA 
AAACATTTGGAATATATGCTATCTATAGA 

CTCCTCACTGCATAAGGACTACTCTTCTCATATTTTCTTCTTTGA 

TTTTGTGATGCCCTCTTGGTTTTGATACTTTAAAATCTGTGGCACCC 

TGGTAAATtCAATCTGTATTTGTTTTGTTAAAGTC 

GTCGAG 



SUBSTITUTE SHEET (RULE 26) 



WO 99/64586 



PCT/EP99/04043 



///re 




GGTGATGAGCCCTTGGGTTCTCGCTCCGACTGCTAAATTCGCTTGGCCGGGTCCACCTTCTCGTGGCCT 

CACTCGCCACACdGATCAGAATCCGGAGCAGGCAGTTCTCTCTATTCTGAGGCrCCTGCGGCTGCCGCG 

GTGACTTCCCTGTGTGGNGGAGGGAACTCTGGGCAGGCTGGTTTTCTTGGAATGTGTTTACGATC 

ATGGGACTTGAACAGGAAGCTGGACGCTGCAGCTGGAACTAGCGTGCCAAGTTATTTATGATTC 

TGATATACAfAGGAGAGA.\ACTGATAGAAGAATTCTGATGGCAACTGTATGATAGAAGCTATATAAAG 

TCAAGTGTCCATTTTCTTTCAACTATATTTGAGCATACCCACKj 

TTGGCTGATCCTCATC ATG .AACCGTGCTTTTAGCAGGAAGAAAGACAAAACATGGATGCATACACCTG 
AAGCTTTATCAA^\CATTTCATTCCCTATAATGCAAAGTTTCTTGGCAGTACAGAAGTGGAACAG^ 

aaggaacagaXgttgtgagagatgctgtaaggaaactaaagtttgcaagacatatcaagaaatctga 

aggccagaaMttcctaaagtggagttgcaaatatcaatttatggagtaaaaattct^ 

caaaggctgaagagatcactttaacaattggccaagcaittgacctggcatacaggaaatttctaga 

tcaggaggaaaAgatgttgaaacaagaaaacagatcgcagggttacaaaaaagaatccaagactt 

aaacagaaaatatggaacttaaaaat^ 

atcagcacctccagcaggcagtatgacacctaagtcgccctccactgacatctttg 

ttctccaatatca6ac 

atctactgaga^ 

atttccctccagatattcaatc 

GAAGGCAfcAGTA^ 

TCATGTTAAATCTGTIT^ 

CAATGTTTTTGA^TTTTAATATTTT 

TAATTTTTATTTTAAAAACAGCTTACTGTAAAGTAGATCA 

GATGAATTTGT^TO 
TCTTGAAATTTGTATT^ 
TAACTTTT[TAAATTCT 
ATGTTACTATTCATTGGACAGATATCA 

CTTTAAACAtAATGCCACAAGGTCACTAATTTCTAGCAGGTAAAATTATAA 
TAAACCAAATGTAiTrAGAGTAl^ 

aaatatttaatttgttitataaagg 

ttcattgcaAAtgagataagtgatattactataacatctaagcatcatctgatttgatattccct 

AACA^GG/L^^^ 

tcctcactggatXaggactactcttctcatattttcttctt^ 

TTTGTGATGCCCTCITGGTTt^ 

GGTAAATTCAATCTGTATTTGTTTTGTTAAAGTCAAAAATCTCATTTT 
TCGAG 
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MNRAFSRKKDKTWMHTPEALSKHFIPYNAKFLGSTEVEQPKGTEVVRDAVRKLKFARHIKKS 
FriQKIPKVFXQrSIYGVKILEPKTK EVOHNCQLHRISFCADDKTDKRIFTFICKDSESNKHLCYV 
FDSEKC AEEITLTIGOAFDLAYRKFLESGGKDVETRKQU^ 

NQLRITQVSAPPAGSMTPKSPSTDIFDMIPFSPISHQSSMPTRNGTQPPPVPSRSTEIKRDLFGAEP 
FDPFNCGAADFPPDIQSKLDEMQEGFKMGLTLEGTVFCLDPLDSRC* 




MNRAFSRKKDKTWMHTPEALSKHFIPYNAKFLGSTEVEQPKGTEVVRDAVRKLIO^ARHIKKS 
EGQKIPKVELQISIYG\^LEPKTKAEEITLTIGQAFDLAYRKFLESGGKJDVETRKQIAGLQKR1Q 
DLETENMELKNKVQDLENQLRITQVSAPPAGSMTPKSPSTDIFDMIPFSPISHQSSMPTRKGTQPP 
PVPSRSTEIKRDLFGAEPFDPFNCGAADFPPDIQSKLDEMQEGFKMGLTLEGTVFCLDPLDSRC* 
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Full length Ced-6 (700 bp) 

Splice variant Ced-6 (500 bp) 
Artefact (450 bp) 
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SQ SEQUENCE 4729 BP 

TAGTTATTAA TAGTAATCAA TTACGGGGTC ATTAGTTCAT AGCCCATATA TGGAGTTCCG 

60 

CGTTACATAA CTTACGGTAA ATGGCCCGCC TGGCTGACCG CCCAACGACC CCCGCCCATT 

120 

GACGTCAATA ATGACGTATG TTCCCATAGT AACGCCAATA GGGACTTTCC ATTGACGTCA 

180 

ATGGGTGGAG TATTTACGGT AAACTGCCCA CTTGGCAGTA CATCAAGTGT ATCATATGCC 

240 

AAGTACGCCC CCTATTGACG TCAATGACGG TAAATGGCCC GCCTGGCATT ATGCCCAGTA 

300 

CATGACCTTA TGGGACTTTC CTACTTGGCA GTACATCTAC GTATTAGTCA TCGCTATTAC 

360 

CATGGTGATG CGGTTTTGGC AGTACATCAA TGGGCGTGGA TAGCGGTTTG ACTCACGGGG 

420 

ATTTCCAAGT CTCCACCCCA TTGACGTCAA TGGGAGTTTG TTTTGGCACC AAAATCAACG 

480 

GGACTTTCCA AAATGTCGTA ACAACTCCGC CCCATTGACG CAAATGGGCG GTAGGCGTGT 

540 

ACGGTGGGAG GTCTATATAA GCAGAGCTGG TTTAGTGAAC CGTCAGATCC GCTAGCGCTA 

600 

CCGGACTCAG ATCTCGAGCT CAAGCTTCGA ATTCTGCAGT CGACGGTACC GCGGGCCCGG 

660 

GATCCATCGC CACCATGGTG AGCAAGGGCG AGGAGCTGTT CACCGGGGTG GTGCCCATCC 

720 

TGGTCGAGCT GGACGGCGAC GTAAACGGCC ACAAGTTCAG CGTGTCCGGC GAGGGCGAGG 

780 

GCGATGCCAC CTACGGCAAG CTGACCCTGA AGTTCATCTG CACCACCGGC AAGCTGCCCG 
TGCCCTGGCC CACCCTCGTG ACCACCCTGA CCTACGGCGT GCAGTGCTTC AGCCGCTACC 

900 

CCGACCACAT GAAGCAGCAC GACTTCTTCA AGTCCGCCAT GCCCGAAGGC TACGTCCAGG 

960 

AGCGCACCAT CTTCTTCAAG GACGACGGCA ACTACAAGAC CCGCGCCGAG GTGAAGTTCG 

1020 

AGGGCGACAC CCTGGTGAAC CGCATCGAGC TGAAGGGCAT CGACTTCAAG GAGGACGGCA 

1080 

ACATCCTGGG GC ACAAGCTG GAGTACAACT ACAACAGCCA CAACGTCTAT ATCATGGCCG 

1140 

ACAAGCAGAA GAACGGCATC AAGGTGAACT TCAAGATCCG CCACAACATC GAGGACGGCA 
1200 . 

GCGTGCAGCT CGCCGACCAC TACCAGCAGA ACACCCCCAT CGGCGACGGC CCCGTGCTGC 
1260 .' . 

TGCCCGACAA CCACTACCTG AGCACCCAGT CCGCCCTGAG CAAAGACCCC AACGAGAAGC 

1320 

GCGATCACAT GGTCCTGCTG GAGTTCGTGA CCGCCGCCGG GATCACTCTC GGCATGGACG 

1380 

AGCTGTACAA GTAAAGCGGC CGCGACTCTA GATCATAATC AGCCATACCA CATTTGTAGA 

1440 

GGTTTTACTT GCTTTAAAAA ACCTCCCACA CCTCCCCCTG AACCTGAAAC ATAAAATGAA 

1500 
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TGCAATTGTT GTTGTTAACT TGTTTATTGC AGCTTATAAT GGTTACAAAT AAAGCAATAG 

1560 

CATCACAAAT TTCACAAATA AAGCATTTTT TTCACTGCAT TCTAGTTGTG GTTTGTCCAA 

1620 

ACTCATCAAT GTATCTTAAG GCGTAAATTG TAAGCGTTAA TATTTTGTTA AAATTCGCGT 

1680 

TAAATTTTTG TTAAATCAGC TCATTTTTTA ACCAATAGGC CGAAATCGGC AAAATCCCTT 

1740 

ATAAATCAAA AGAATAGACC GAGATAGGGT TGAGTGTTGT TCCAGTTTGG AACAAGAGTC 

1800 

CACTATTAAA GAACGTGGAC TCCAACGTCA AAGGGCGAAA AACCGTCTAT CAGGGCGATG 

1860 

GCCCACTACG TGAACCATCA CCCTAATCAA GTTTTTTGGG GTCGAGGTGC CGTAAAGCAC 

1920 

TAAATCGGAA CCCTAAAGGG AGCCCCCGAT TTAGAGCTTG ACGGGGAAAG CCGGCGAACG 
1980 . ; 

. TGGCGAGAAA GGAAGGGAAG AAAGCGAAAG GAGCGGGCGC TAGGGCGCTG GCAAGTGTAG 
2040 . ; . ■ . 

CGGTCACGCT GCGCGTAACC ACCACACCCG CCGCGCTTAA TGCGCCGCTA CAGGGCGCGT 

2100 

. . CAGGTGGCAC TTTTCGGGGA AATGTGCGCG GAACCCCTAT TTGTTTATTT TTCTAAATAC 
2160 

ATT CAAATAT GTATCCGCTC ATGAGACAAT AACCCTGATA AATGCTTCAA TAATATTGAA 

2220 

AAAGGAAGAG TCCTGAGGCG GAAAGAACCA GCTGTGGAAT GTGTGTCAGT TAGGGTGTGG 
■2280 ' . ,• . . 

aaAgtgccca GGCTCCCCAG CAGGCAGAAG TATGCAAAGC ATGCATCTCA ATTAGTCAGC 
2340 " ■. ' - 

AACCAGGTGT ggaaagtccc caggctcccc agcaggcaga agtatgcaaa gcatgcatct 

2400 

. CAATTAGTCA GCAACCATAG TCCCGCCCCT AACTCCGCCC ATCCCGCCCC TAACTCCGCC 
2460 

CAGTTCCGCC CATTCTCCGC CCCATGGCTG ACTAATTTTT TTTATTTATG CAGAGGCCGA 

2520 

GGCCGGCTCG GCCTCTGAGC TATTCCAGAA GTAGTGAGGA GGCTTTTTTG GAGGCCTAGG 
2580 ' ■[ 

CTTTTGCAAA GATCGATCAA GAGACAGGAT GAGGATCGTT TCGCATGATT GAACAAGATG 
2640 • 

GATTGCACGC AGGTTCTCCG GCCGCTTGGG TGGAGAGGCT ATTCGGCTAT GACTGGGCAC 
2700 ; . \\ . ..... ^. : - - ; , . ... .. . 

- . A^CAGACAAT CGGCTGCTCT GATGCCGCCG TGTTCCGGCT gtcagcgcag gggcgcccgg 

2760* . : ;r v . 

f TCTTTTTGT CAAGACCGAC CTGTCCGGTG CCCTGAATGA ACTGCAAGAC GAGGCAGCGC 

2820 

GGCTATCGTG . GCTGGCCACG ACGGGCGTTC CTTGCGCAGC TGTGCTCGAC GTTGTCACTG 
2880 .- 

AAGCGGGAAG GGACTGGCTG CTATTGGGCG AAGTGCCGGG GCAGGATCTC CTGTCATCTC 

2940 

ACCTTGCTCC TGGCGAGAAA GTATCCATCA TGGCTGATGC AATGCGGCGG CTGCATACGC 

3000 

TTGATCCGGC TACCTGCCCA TTCGACCACC AAGCGAAACA TCGCATCGAG CGAGCACGTA 

3060 

CTCGGATGGA AGCCGGTCTT GTCGATCAGG ATGATCTGGA CGAAGAGCAT CAGGGGCTCG 
3120 ' * 

CGCCAGCCGA ACTGTTCGCC AGGCTCAAGG CGAGCATGCC CGACGGCGAG GATCTCGTCG 

3180 
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TGACCCATGG CGATGCCTGC TTGCCGAATA TCATGGTGGA AAATGGCCGC TTTTCTGGAT 

3240 

TCATCGACTG TGGCCGGCTG GGTGTGGCGG ACCGCTATCA GGACATAGCG TTGGCTACCC 

3300 

GTGATATTGC TGAAGAGCTT GGCGGCGAAT GGGCTGACCG CTTCCTCGTG CTTTACGGTA 

3360 

TCGCCGCTCC CGATTCGCAG CGCATCGCCT TCTATCGCCT TCTTGACGAG TTCTTCTGAG 

3420 

CGGGACTCTG GGGTTCGAAA TGACCGACCA AGCGACGCCC AACCTGCCAT CACGAGATTT 

3480 

CGATTCCACC GCCGCCTTCT ATGAAAGGTT GGGCTTCGGA ATCGTTTTCC GGGACGCCGG 

3540 

CTGGATGATC CTCCAGCGCG GGGATCTCAT GCTGGAGTTC TTCGCCCACC CTAGGGGGAG 

3600 

GCTAACTGAA ACACGGAAGG AGACAATACC GGAAGGAACC CGCGCTATGA CGGGAATAAA 

3660 

AAGACAGAAT AAAACGCACG GTGTTGGGTC GTTTGTTCAT AAACGCGGGG TTCGGTCCCA 

3720 

GGGCTGGCAC TCTGTCGATA CCCCACCGAG ACCCCATTGG GGCCAATACG CCCGCGTTTC 

3780 

TTCCTTTTCC CCACCCCACC CCCCAAGTTC GGGTGAAGGC CCAGGGCTCG CAGCCAACGT 

3840 

CGGGGCGGCA GGCCCTGCCA TAGCCTCA6G TTACTCATAT ATACTTTAGA TTGATTTAAA 

3900 

. ACTTCATTTT TAATTTAAAA GGATCTAGGT GAAGATCCTT TTTGATAATC TCATGACCAA 

3960 

AATCCCTTAA CGTGAGTTTT CGTTCCACTG AGCGTCAGAC CCCGTAGAAA AGATCAAAGG 

4020 

ATCTTCTTGA GATCCTTTTT TTCTGCGCGT AATCTGCTGC TTGCAAACAA AAAAACCACC 

4080 

GCTACCAGCG GTGGTTTGTT TGCCGGATCA AGAGCTACCA ACTCTTTTTC CGAAGGTAAC 

4140 

TGGCTTCAGC AGAGCGCAGA TACCAAATAC TGTCCTTCTA GTGTAGCCGT AGTTAGGCCA 

4200 

CCACTTCAAG AACTCTGTAG CACCGCCTAC ATACCTCGCT CTGCTAATCC TGTTACCAGT 

4260 

GGCTGCTGCC AGTGGCGATA AGTCGTGTCT TACCGGGTTG GACTCAAGAC GATAGTTACC 

4320 

. GGATAAGGCG CAGCGGTCGG GCTGAACGGG GGGTTCGTGC ACACAGCCCA GCTTGGAGCG 

4380 

AACGACCTAC ACCGAACTGA GATACCTACA GCGTGAGCTA TGAGAAAGCG CCACGCTTCC 

4440 

CGAAGGGAGA AAGGCGGACA GGTATCCGGT AAGCGGCAGG GTCGGAACAG GAGAGCGCAC 

4500 

GAGGGAGCTT CCAGGGGGAA ACGCCTGGTA TCTTTATAGT CCTGTCGGGT TTCGCCACCT 

4560 

CTGACTTGAG CGTCGATTTT TGTGATGCTC GTCAGGGGGG CGGAGCCTAT GGAAAAACGC 

4620 

CAGCAACGCG GCCTTTTTAC GGTTCCTGGC CTTTTGCTGG CCTTTTGCTC ACATGTTCTT 

4680 

TCCTGCGTTA TCCCCTGATT CTGTGGATAA CCGTATTACC GCCATGCAT 

4729 

// 
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/7a. & 



SQ 


SEQUENCE 


5619 BP 










60 

1 O A 

120 


GATCCCCATG 


AACCGTGCTT 


TTAGCAGGAA 


GAAAGACAAA 


ACATGGATGC 


ATACACCTGA 


AGCTTTATCA 


AAACATTTCA 


TTCCCTATAA 


TGCAAAGTTT 


CTTGGCAGTA 


CAGAAGTGGA 


180 
240 
300 


ACAGCCAAAA 


GGAACAGAAG 


TTGTGAGAGA 


TGCTGTAAGG 


AAACTAAAGT 


TTGCAAGACA 


TATCAAGAAA 


TCTGAAGGCC 


AGAAAATTCC 


TAAAGTGGAG 


TTGCAAATAT 


CAATTTATGG 


AGTAAAAATT 


CTAGAACCCA 


AAACAAAGGA 


AGTTCAACAC 


AATTGCCAGC 


TTCATAGAAT 


360 

A 1 A 


ATCTTTTTGT 


GCAGATGATA 


AAACTGACAA 


GAGGATATTC 


ACTTTCATAT 


GCAAAGATTC 


TGAGTCAAAT 


AAACATTTGT 


GCTATGTATT 


TGACAGCGAA 


AAGTGTGCTG 


AAGAGATCAC 


ion 

4 80 
540 

C A A 

600 


TTTAACAATT 


GGCCAAGCAT 


TTGACCTGGC 


ATACACGAAA 


TTTCTAGAAT 


CAGGAGGAAA 


AGATGTTGAA 


ACAAGAAAAC 


AGATCGCAGG 


GTTACAAAAA 


AGAATCCAAG 


ACTTAGAAAC 


AGAAAATATG 


GAACTTAAAA 


ATAAAGTACA 


AGATTTGGAA 


AACCAACTGA 


GAATAACTCA 


660 


AGTATCAGCA 


CCTCCAGCAG 


GCAGTATGAC 


ACCTAAGTCG 


CCCTCCACTG 


ACATCTTTGA 


TO A 

/ o U 

O H U 


TATGATTCCA 


TTTTCTCCAA 


TATCACACCA 


GTCTTCGATG 


CCTACTCGCA 


ATGGCACACA 


GCCACCTCCA 


GTACCTAGTA 


GATCTACTGA 


GATTAAACGG 


GACCTGTTTG 


GAGCAGAACC 


TTTTGACCCA 


TTTAACTGTG 


GAGCAGCAGA 


TTTCCCTCCA 


GATATTCAAT 


CAAAATTAGA 


TGAGATGCAG 


GAGGGGTTCA 


AAATGGGACT 


AACTCTTGAA 


GGCACAGTAT 


TTTGTCTCGA 


960 , 


CCCGTTAGAC 


AGTAGGTGCG 


TCGACGGTAC 


CGCGGGCCCG 


GG AT CCATCG 


CCACCATGGT 


1020 


GAGCAAGGGC 


GAGGAGCTGT 


TCACCGGGGT 


GGTGCCCATC 


CTGGTCGAGC 


TGGACGGCGA 


1080 


CGTAAACGGC 


CACAAGTTCA 


GCGTGTCCGG 


CGAGGGCGAG 


GGCGATGCCA 


CCTACGGCAA 


1140 


GCTGACCCTG 


AAGTTCATCT 


GCACCACCGG 


CAAGCTGCCC 


GTGCCCTGGC 


CCACCCTCGT 


1200 


GACCACCCTG 


ACCTACGGCG 


TGCAGTGCTT 


CAGCCGCTAC 


CCCGACCACA 


TGAAGCAGCA 


1260 


CGACTTCTTC 


AAGTCCGCCA 


TGCCCGAAGG 


CTACGTCCAG 


GAGCGCACCA 


TCTTCTTCAA 


1320 


GGACGACGGC 


AACTACAAGA 


CCCGCGCCGA 


GGTGAAGTTC 


GAGGGCGACA 


CCCTGGTGAA 


1380 


CCGCATCGAG 


CTGAAGGGCA 


TCGACTTCAA 


GGAGGACGGC 


AACATCCTGG 


GGCACAAGCT 


1440 
1500 


GGAGTACAAC 


TACAACAGCC 


ACAACGTCTA 


TATCATGGCC 


GACAAGCAGA 


AGAACGGCAT 


CAAGGTGAAC 


TTCAAGATCC 


GCCACAACAT 


CGAGGACGGC 


AGCGTGCAGC 


TCGCCGACCA 
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. CTACCAGCAG AACACCCCCA TCGGCGACGG CCCCGTGCTG CTGCCCGACA ACCACTACCT 

1560 

GAGCACCCAG TCCGCCCTGA GCAAAGACCC CAACGAGAAG CGCGATCACA TGGTCCTGCT 

1620 

GGAGTTCGTG ACCGCCGCCG GGATCACTCT CGGCATGGAC GAGCTGTACA AGTAAAGCGG 

1680 

CCGCGACTCT AGATCATAAT CAGCCATACC ACATTTGTAG AGGTTTTACT TGCTTTAAAA 
1740 V 

'- AACCTCCCAC ACCTCCCCCT GAACCTGAAA CATAAAATGA ATGCAATTGT TGTTGTTAAC 
1800 / 

•TTGTTTATTG CAGCTTATAA TGGTTACAAA TAAAGCAATA GCATCACAAA TTTCACAAAT 

1860 

AAAGCATTTT TTTCACTGCA TTCTAGTTGT GGTTTGTCCA AACTCATCAA TGTATCTTAA 

1920 

"GGCGTAAATT GTAAGCGTTA ATATTTTGTT AAAATTCGCG. TTAAATTTTT GTTAAATCAG 

1980 

CTCATTTTTT AACCAATAGG CCGAAATCGG CAAAATCCCT TATAAATCAA AAGAATAGAC 

2040 

; CGAGATAGGG TTGAGTGTTG TTCCAGTTTG GAACAAGAGT CCACTATTAA AGAACGTGGA 
2100 

; CTCCAACGTC AAAGGGCGAA AAACCGTCTA TCAGGGCGAT GGCCCACTAC GTGAACCATC 
2160 - 

. ACCCTAATCA AGTTTTTTGG GGTCGAGGTG CCGTAAAGCA CTAAATCGGA ACCCTAAAGG 

2220 

GAGCCCCCGA TTTAGAGCTT GACGGGGAAA GCCGGCGAAC GTGGCGAGAA AGGAAGGGAA 

2280 

/ - GAAAGCGAAA GGAGCGGGCG CTAGGGCGCT GGCAAGTGTA GCGGTCACGC TGCGCGTAAC 

2340 : > " : : • . \ ■. ■". . ; 

' CACCACACCC GCCGCGCTTA ATGCGCCGCT ACAGGGCGCG TCAGGTGGCA CTTTTCGGGG 
2400 ■' >; . 

AAATGTGCGC GGAACCCCTA TTTGTTTATT TTTCTAAATA CATTCAAATA TGTATCCGCT 
2460 V 

. CAT GAGACAA TAACCCTGAT AAATGCTTCA ATAATATTGA AAAAGGAAGA GTCCTGAGGC 
2520 

GGAAAGAACC AGCTGTGGAA TGTGTGTCAG TTAGGGTGTG GAAAGTCCCC AGGCTCCCCA 

2580 

; " ■ ' ' GCAGGCAGAA GXATGCAAAG CATGCATCTC AATTAGTCAG CAACCAGGTG TGGAAAGTCC 
2640 

7 CCAGGCTCCC CAGCAGGCAG AAGTATGCAA AGCATGCATC TCAATTAGTC AGCAACCATA 
2700;: v 

. GTCCCGCCCC TAACTCCGCC CATCCCGCCG CTAACTCCGC CCAGTTCCGC CCATTCTCCG 

2760'->7, 

; J_ -CCCCATGGCT GACTAATTTT TTTTATTTAT GCAGAGGCCG AGGCCGCCTC GGCCTCTGAG 

2820 ; 

CTATTCCAGA AGTAGTGAGG AGGCTTTTTT GGAGGCCTAG GCTTTTGCAA AGATGGATCA 

2880 

AGAGACAGGA TGAGGATCGT TTCGCATGAT TGAACAAGAT GGATTGCACG CAGGTTCTCC 

2940 

GGCCGCTTGG GTGGAGAGGC TATTCGGCTA TGACTGGGCA CAACAGACAA TCGGCTGCTC 

3000 

. TGATGCCGCC GTGTTCCGGC TGTCAGCGCA GGGGCGCCCG GTTCTTTTTG TCAAGACCGA 
3060 

CCTGTCCGGT GCCCTGAATG AACTGCAAGA CGAGGCAGCG CGGCTATCGT GGCTGGCCAC 

3120 

GACGGGCGTT CCTTGCGCAG CTGTGCTCGA CGTTGTCACT GAAGCGGGAA GGGACTGGCT 

3180 
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GCTATTGGGC GAAGTGCCGG GGCAGGATCT CCTGTCATCT CACCTTGCTC CTGCCGAGAA 

3240 

AGTATCCATC ATGGCTGATG CAATGCGGCG GCTGCATACG CTTGATCCGG CTACCTGCCC 

3300 

ATTCGACCAC CAAGCGAAAC ATCGCATCGA GCGAGCACGT ACTCGGATGG AAGCCGGTCT 

3360 

TGTCGATCAG GATGATCTGG ACGAAGAGCA TCAGGGGCTC GCGCCAGCCG AACTGTTCGC 

3420 

CAGGCTCAAG GCGAGCATGC CCGACGGCGA GGATCTCGTC GTGACCCATG GCGATGCCTG 

3480 

CTTGCCGAAT ATCATGGTGG AAAATGGCCG CTTTTCTGGA TTCATCGACT GTGGCCGGCT 

3540 

GGGTGTGGCG GACCGCTATC AGGACATAGC GTTGGCTACC CGTGATATTG CTGAAGAGCT 

3600 

TGGCGGCGAA TGGGCTGACC GCTTCCTCGT GCTTTACGGT ATCGCCGCTC CCGATTCGCA 

3660 

GCGCATCGCC TTCTATCGCC TTCTTGACGA GTTCTTCTGA GCGGGACTCT GGGGTTCGAA 

3720 

ATGACCGACC AAGCGACGCC CAACCTGCCA TCACGAGATT TCGATTCCAC CGCCGCCTTC 

3780 

TATGAAAGGT TGGGCTTCGG AATCGTTTTC CGGGACGCCG GCTGGATGAT CCTCCAGCGC 

3840 

GGGGATCTCA TGCTGGAGTT CTTCGCCCAC CCTAGGGGGA GGCTAACTGA AACACGGAAG 

3900 

GAGACAATAC CGGAAGGAAC CCGCGCTATG ACGGCAATAA AAAGACAGAA TAAAACGCAC 

3960 

GGTGTTGGGT CGTTTGTTCA TAAACGCGGG GTTCGGTCCC AGGGCTGGCA CTCTGTCGAT 

4020 

ACCCCACCGA GACCCCATTG GGGCCAATAC GCCCGCGTTT CTTCCTTTTC CCCACCCCAC 

4080 

CCCCCAAGTT CGGGTGAAGG CCCAGGGCTC GCAGCCAACG TCGGGGCGGC AGGCCCTGCC 

4140 

ATAGCCTCAG GT TACT CAT A TATACTTTAG ATTGATTTAA AACTTCATTT TTAATTTAAA 

4200 

AGGATCTAGG TGAAGATCCT TTTTGATAAT CT.CATGACCA AAATCCCTTA ACGTGAGTTT 

4260 

TCGTTCCACT GAGCGTCAGA CCCCGTAGAA AAGATCAAAG GATCTTCTTG AGATCCTTTT 

4320 

TTTCTGCGCG TAATCTGCTG CTTGCAAACA AAAAAACCAC CGCTACCAGC GGTGGTTTGT 

4380 

TTGCCGGATC AAGAGCTACC AACTCTTTTT CCGAAGGTAA CTGGCTTCAG CAGAGCGCAG 

4440 

ATACCAAATA CTGTCCTTCT AGTGTAGCCG TAGTTAGGCC ACCACTTCAA GAACTCTGTA 

4500 

GCACCGCCTA CATACCTCGC TCTGCTAATC CTGTTACCAG TGGCTGCTGC CAGTGGCGAT 

4560 

AAGTCGTGTC TTACCGGGTT GGACTCAAGA CG AT AG TT AC CGGATAAGGC GCAGCGGTCG 

4620 

GGCTGAACGG GGGGTTCGTG CACACAGCCC AGCTTGGAGC GAACGACCTA CACCGAACTG 

4680 

AGATACCTAC AGCGTGAGCT ATGAGAAAGC GCCACGCTTC CCGAAGGGAG AAAGGCGGAC 

4740 

AGGTATCCGG TAAGCGGCAG GGTCGGAACA GGAGAGCGCA CGAGGGAGCT TCCAGGGGGA 

4800 

AACGCCTGGT ATCTTTATAG TCCTGTCGGG TTTCGCCACC TCTGACTTGA GCGTCGATTT 

4860 
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TTGTGATGCT CGTCAGGGGG GCGGAGCCTA TGGAAAAACG CCAGCAACGC GGCCTTTTTA 

4920 

CGGTTCCTGG CCTTTTGCTG GCCTTTTGCT CACATGTTCT TTCCTGCGTT ATCCCCTGAT 

4980 

„- ■. TCTGTGGATA ACCGTATTAC CGCCATGCAT TAGTTATTAA TAGTAATCAA TTACGGGGTC 
5040 ; 

ATTAGTTCAT AG CC CAT AT A TGGAGTTCGG CGTTACATAA CTTACGGTAA ATGGCCCGCC 

5100 ' 

TGGCTGACCG CCCAACGACC CCCGCCCATT GACGTCAATA ATGACGTATG TTCCCATAGT 

5160. :•• 

\. ; AACGCCAATA GGGACTTTCC ATTGACGTCA ATGGGTGGAG TATTTACGGT AAACTGCCCA 

5220 yy-':":- • / ■ ■ • 

- CTTGGCAGTA CATCAAGTGT ATCATATGCC AAGTACGCCC CCTATTGACG TCAATGACGG 
5280 - 

■ TAAATGGCCC GCCTGGCATT ATGCCCAGTA CATGACCTTA TGGGACTTTC CTACTTGGCA 
5340 . 

GTACATCTAC GTATTAGTCA TCGCTATTAC CATGGTGATG CGGTTTTGGC AGTACATCAA 

5400 

rTGGGCGTGGA TAGCGGTTTG ACTCACGGGG ATTTCCAAGT CTCCACCCCA TTGACGTCAA 
5460 \ 

TGGGAGTTTG TTTTGGCACC AAAATCAACG GGACTTTCCA AAATGTCGTA ACAACTCCGC 

5520 

CCCATTGACG CAAATGGGCG GTAGGCGTGT ACGGTGGGAG GTCTATATAA GCAGAGCTGG 

5580 . ; ", " ; . 

/■ ; V TTTAGTGAAC CGTCAGATCC GCTAGCGCTA CCGGACTCA . 

5619 

// 
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ID pcDNA3. 1/His/LacZ circular DNA; 8578 BP 



SQ SEQUENCE 8578 Br; 

GACGGATCGG GAGATCTCCC GATCCCCTAT GGTCGACTCT CAGTACAATC TGCTCTGATG 60 

CCGCATAGTT AAGCCAGTAT CTGCTCCCTG CTTGTGTGTT GGAGGTCGCT GAGTAGTGCG 120 

CGAGCAAAAT TTAAGCTACA ACAAGGCAAG GCTTGACCGA CAATTGCATG AAGAATCTGC 180 

TTAGGGTTAG GCGTTTTGCG CTGCTTCGCG ATGTACGGGC CAGATATACG CGTTGACATT 24 0 

GATTATTGAC TAGTTAT7AA TAGTAATCAA TTACGGGGTC ATTAGTTCAT AGCCCATATA 300 

TGGAGTTCCG CGTTACA7AA CTTACGGTAA ATGGCCCGCC TGGCTGACCG CCCAACGACC 360 

CCCGCCCATT GACGTCAATA ATGACGTATG TTCCCATAGT AACGCCAATA GGGACTTTCC 4 20 

ATTGACGTCA ATGGGTGGAC TATTTACGGT AAACTGCCCA CTTGGCAGTA CATCAAGTGT 480 

ATCATATGCC AAGTACG CCC CCTATTGACG TCAAT GACGG TAAATGGCCC GCCTGGCATT 540 

ATGCCCAGTA CATGACC7TA TGGGACTTTC CTACTTGGCA GTACATCTAC GTATTAGTCA 600 

TCGCTATTAC CATGG7GATG CGGTTTTGGC AGTACATCAA TGGGCGTGGA TAGCGGTTTG 660 

ACTCACGGGG ATTTCCAAGT CTCCACCCCA TTGACGTCAA TGGGAGTTTG TTTTGGCACC 720 

AAAATCAACG GGACTTTCCA AAATGTCGTA . ACAACTCCGC CCCATTGACG CAAATGGGCG 780 

GTAGGCGTGT ACGGTGGGAG GTCTATATAA GCAGAGCTCT CTGGCTAACT AGAGAACCCA 840 

CTGCTTACTG GCTTATCGAA ATTAATACGA CTGACTATAG GGAGACCCAA GCTGGCTAGC 900 

GTTTAAACTT AAGCTTACCA TGGGGGGTTC f CAT CATC AT CATCATCATG GTATGGCTAG 960 

CATGACTGGT GGACAGCAAA TGGGTCGGGA TCTGTACGAC GATGACGATA AGGTACCTAA 1020 

GGATCAGCTT GGAGTTGATC CCGTCGTTTT ACAACGTCGT G ACT GGG AAA ACCCTGGCGT 1080 

TACCCAACTT AATCGCCTTG CAGCACATCC CCCTTTCGCC AGCTGGCGTA ATAGCGAAGA 1140 

GGCCCGCACC GATCGCCCTT CCCAACAGTT GCGCAGCCTG AATGGCGAAT GGCGCTTTGC 1200 

CTGGTTTCCG GCACCAGAAG CGGTGCCGGA AAGCTGGCTG GAGTGCGATC TTCCTGAGGC 1260 

CGATACTGTC GTCGTCCCCT CAAACTGGCA GATGCACGGT TACGATGCGC CCATCTACAC 1320 

CAACGTAACC TATCCCATTA CGGTCAATCC GCCGTTTGTT CCCACGGAGA ATCCGACGGG 1380 

TTGTTACTCG CTCACATTTA ATGTTGATGA AAGCTGGCTA CAGGAAGGCC AGACGCGAAT 14 40 

TATTTTT^AT GGCGTTAACT CGGCGTTTCA TCTGTGGTGC AACGGGCGCT GGGTCGGTTA 1500 

CGGCCAGGAC AGTCGTTTGC CGTCTGAATT TGACCTGAGC GCATTTTTAC GCGCCGGAGA 1560 

AAACCGCCTC GCGGTGATGG TGCTGCGTTG GAGTGACGGC AGTTATCTGG AAGATCAGGA 1620 

TATGTGGCGG ATGAGCGGCA TTTTCCGTGA CGTCTCGTTG CTGCATAAAC CGACTACACA 1680 

AATCAGCGAT TTCCATGTTG CCACTCGCTT TAATGATGAT TTCAGCCGCG CTGTACTGGA 1740 

GGCTGAaStT CAGATGTGCG GCGAGTTGCG TGACTACCTA CGGGTAACAG TTTCTTTATG 1800 

GCAGGGTGAA ACGCAGGTCG CCAGCGGCAC CGCGCCTTTC GGCGGTGAAA TTATCGATGA 18 60 

GCGTGGTGGT TATGCCGATC GCGTCACACT ACGTCTGAAC GTCGAAAACC CGAAACTGTG 1920 

GAGCGCCGAA ATCCCGAATC TCTATCGTGC GGTGGTTGAA CTGCACACCG CCGACGGCAC 1980 

GCTGATTGAA . GCAGAAGCCT GCGATGTCGG TTTCCGCGAG GTGCGGATTG AAAATGGTCT 204 0 

GCTGCTGCTG AACGGCAAGC CGTTGCTGAT TCGAGGCGTT AACCGTCACG AGCATCATCC 2100 

TCTGCATGGT CAGGTCATGG ATGAGCAGAC GATGGTGCAG GATATCCTGC TGATGAAGCA 2160 

GAACAACTTT AACGCCG7GC GCTGTTCGCA TTATCCGAAC CATCCGCTGT GGTACACGCT 2220 

GTGCGACCGC TACGGCCTGT ATGTGGTGGA TGAAGCCAAT ATTGAAACCC ACGGCATGGT 2280 

GCfc^TGAAT , CGTCTGACCG ATGATCCGCG CTGGCTACCG GCGATGAGCG AACGCGTAAC 2340 

. ,dCGiSTGGTG CAGCGCGATC GTAATCACCC GAGTGTGATC ATCTGGTCGC TGGGGAATGA 24 00 

ATCAGGCCAC GGCGCT^.TZ ACGACGCGCT GTATCGCTGG ATCAAATCTG TCGATCCTTQ 24 60 

CCGCC^GTG CAGTATGAAG GCGGCGGAGC CGACACCACG GCCACCGATA TTATTTGCCC 2520 

GATGTACGCG CGCGTGGATG AAGACCAGCC CTTCCCGGCT GTGCCGAAAT GGTCCATCAA 2580 

AAAATGGCTT TCGCTACCTG GAGAGACGCG CCCGCTGATC CTTTGCGAAT ACGCCCACGC 2640 

. GATGGGTAAC AGTCTTGGCG GTTTCGCTAA ATACTGGCAG GCGTTTCGTC AGTATCCCCG 2700 

TTT AC AG GGC GGCTTCGTC7 GGGACTGGGT GGATCAGTCG CTGATTAAAT ATGATGAAAA 27 60 

CGGCAACCCG TGGTCGGCTT ACGGCGGTGA TTTTGGCGAT ACGCCGAACG ATCGCCAGTT 2820 

CTGTATGAAC GGTCTGGTCT TTGCCGACCG CACGCCGCAT CCAGCGCTGA CGGAAGCAAA 2880 

ACACCAGCAG CAGTTTT7CC AGTTCCGTTT ATCCGGGCAA ACCATCGAAG TGACCAGCGA 2940 

ATACCTGTTC CGTCATAGCG ATAACGAGCT CCTGCACTGG ATGGTGGCGC TGGATGGTAA 3000 

GCCGCTGGCA AGCGG7GAAG TGCCTCTGGA TGTCGCTCCA CAAGGTAAAC AGTTGATTGA 30 60 

ACTGCCTGAA CTACCGCAGC CGGAGAGCGC CGGGCAACTC TGGCTCACAG TACGCGTAGT 3120 
GCAACCGAAC GCGACCGCAT GGTCAGAAGC CGGGCACATC AGCGCCTGGC AGCAGTGGCG " 3180 

TCTGGCGGAA AACCTCAG7G TGACGCTCCC CGCCGCGTCC CACGCCATCC CGCATCTGAC 32 4 0 
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CACCAGCGAA ATGGATTTTT GCATCGAGCT GGG TAATAAG CGTTGGCAAT TTAACCGCCA 3300 

GTCAGGCTTT CTTTCACAGA TGTGGATTGG CGATAAAAAA CAACTGCTGA CGCCGCTGCG 3360 

CGATCAGTTC ACCCGTGCAC CGC TGGATAA CGACATTGGC GTAAGTGAAG CGACCCGCAT 3420 

TGACCCTAAC GCCTGGGTCG AACGCTGGAA GGCGGCGGGC CATTACCAGG CCGAAGCAGC 3480 

GTTGTTGCAG TGCACGGCAG ATACACTTGC TGATGCGGTG CTGATTACGA CCGCTCACGC 3540 

GTGGCAGCAT CAGGGGAAAA CCTTATTTAT CAGCCGGAAA ACCTACCGGA TTGATGGTAG 3600 

TGGTCAAATG GCGATTACCG TTGATGTTGA AGTGGCGAGC GATACACCGC ATCCGGCGCG 3660 

GATTGGCCTG AACTGCCAGC - TGGCGCAGGT AGCAGAGCGG GTAAACTGGC TCGGATTAGG 3720 

GCCGCAAGAA AACTATCCCG ACCGCCTTAC TGCCGCCTGT TTTGACCGCT GGGATCTGCC 3780 

ATTGTCAGAC ATGTATACCC CGTACGTCTT CCCGAGCGAA AACGGTCTGC GCTGCGGGAC 3840 

GCGCGAATTG AATTATGGCC CACACCAGTG GCGCGGCGAC TTCCAGTTCA ACATCAGCCG 3900 

CTACAGTCAA CAGCAACTGA TGGAAACCAG CCATCGCCAT CTGCTGCACG CGGAAGAAGG 3960 

CACATGGCTG AATATCGACG GTTTCCATAT GGGGATTGGT GGCGACGACT CCTGGAGCCC 4020 

GTCAGTATCG GCGGAGTTCC AGCTGAGCGC CGGTCGCTAC CATTACCAGT TGGTCTGGTG 4080 

TCAAAAATAA TAAAGCCGAA TTCTGCAGAT ATCCAGCACA GTGGCGGCCG CTCGAGTCTA 414 0 

GAGGGCCCGT TTAAACCCGC TGATCAGCCT CGACTGTGCC TTCTAGTTGC CAGCCATCTG 4 200 

TTGTTTGCCC CTCCCCCGTG CCTTCCTTGA CCCTGGAAGG TGCCACTCCC ACTGTCCTTT 4 260 

CCTAATAAAA TGAGGAAATT GCATCGCATT GTCTGAGTAG GTGTCATTCT ATTCTGGGGG 4 320 

GTGGGGTGGG GCAGGACAGC AAGGGGGAGG ATTGGGAAGA CAATAGCAGG CATGCTGGGG 4 380 

ATGCGGTGGG CTCTATGGCT TCTGAGGCGG AAAGAACCAG CTGGGGCTCT AGGGGGTATC 4 440 

CCCACGCGCC CTGTAGCGGC GCATTAAGCG CGGZGGGXG1 GGTGGTTACG CGCAGCGTGA 4500 

CCGCTACACT TGCCAGCGCC CTAGCGCCCG CTCCTTTCGC TTTCTTCCCT TCCTTTCTCG 4 560 

CCACGTTCGC CGGCTTTCCC CGTCAAGCTC TAAATCGGGG CATCCCTTTA GGGTTCCGAT 4 620 

TTAGTGCTTT ACGGCACCTC GACCCCAAAA AACTTGATTA GGGTGATGGT TCACGTAGTG 4 680 

GGCCATCGCC CTGATAGACG GTTTTTCGCC CTTTGACGTT GGAGTCCACG TTCTTTAATA . 4 740 

GTGGACTCTT GTTCCAAACT GGAACAACAC TCAACCCTAT CTCGGTCTAT TCTTTTGATT 4 800 

TATAAGGGAT TTTGGGGATT TCGGCCTATT GGTTAAAAAA TGAGCTGATT TAACAAAAAT 4 860 

TTAACGCGAA TTAATTCTGT GGAATGTGTG TCAGTTAGGG TGTGGAAAGT CCCCAGGCTC 4 920 

CCCAGGCAGG CAGAAGTATG CAAAGCATGC ATCTCAATTA GTCAGCAACC AGGTGTGGAA 4 980 

AGTCCCCAGG CTCCCCAGCA GGCAGAAGTA TGCAAAGCAT GCATCTCAAT TAGTCAGCAA 504 0 

CCATAGTCCC GCCCCTAACT CCGCCCATCC CGCCCCTAAC TCCGCCCAGT TCCGCCCATT 5100 

CTCCGCCCCA TGGCTGACTA ATTTTTTTTA TTTATGCAGA GGCCGAGGCC GCCTCTGCCT 5160 

CTGAGCTATT CCAGAAGTAG TGAGGAGGCT TTTTTGGAGG CCTAGGCTTT TGCAAAAAGC 5220 

TCCCGGGAGC TTGTATATCC ATTTTCGGAT CTGATCAAGA GACAGGATGA GGATCGTTTC 5280 

GCATGATTGA ACAAGATGGA TTGCACGCAG GTTCTCCGGC CGCTTQGGTG GAGAGGCTAT 5340 

TCGGCTATGA CTGGGCACAA CAGACAATCG GCTGCTCTGA TGCCGCCGTG TTCCGGCTGT 5400 

CAGCGCAGGG GCGCCCGGTT CTTTTTGTCA AGACCGACCT GTCCGGTGCC CTGAATGAAC 54 60 

TGCAGGACGA GGCAGCGCGG CTATCGTGGC TGGCCACGAC GGGCGTTCCT TGCGCAGCTG 5520 

TGCTCGACGT TGTCACTGAA GCGGGAAGGG ACTGGCTGCT ATTGGGCGAA . GTGCCGGGGC 5580 

AGGATCTCCT GTCATCTCAC CTTGCTC.CTG CCGAGAAAGT ATCCATCATG GCTGATGCAA 5640 

TGCGGCGGCT GCATACGCTT GATCCGGCTA CCTGCCCATT CGACCACCAA GCGAAACATC 5700 

GCATCGAGCG AGCACGTACT CGGATGGAAG CCGGTCTTGT CGATCAGGAT GATCTGGACG 57 60 

AAGAGCATCA GGGGCTCGCG CCAGCGGAAC TGTTCGCCAG GCTCAAGGCG CGCATGCCCG 5820 

ACGGCGAGGA TCTCGTCGTG ACGCATGGCG ATGCCTGCTT GCCGAATATC ATGGTGGAAA 5880 

ATGGCCGCTT TTCTGGATTC ATCGACTGTG GCCGGCTGGG TGTGGCGGAC CGCTATCAGG 5940 

ACATAGCGTT GGCTACCCGT GATATTGCTG AAGAGCTTGG CGGCGAATGG GCTGACCGCT 6000 

TCCTCGTGCT TTACGGTATC GCCGCTCCCG ATTCGCAGCG CATCGCCTTC TATCGCCTTC 6060 

TTGACGAGTT CTTCTGAGCG GGACTCTGGG GTTCGAAATG ACCGACCAAG CGACGCCCAA 6120 

CCTGCCATCA CGAGATTTCG ATTCCACCGC CGCCTTCTAT GAAAGGTTGG GCTTCGGAAT 6180 

CGTTTTCCGG GACGCCGGCT GGATGATCCT CCAGCGCGGG GATCTCATGC TGGAGTTCTT 6240 

CGCCCACCCC AACTTG7TTA TTGCAGCTTA TAATGGTTAC AAATAAAGCA ATAGCATCAC 6300 

AAATTTCACA AATAAAGCAT TTTTTTCACT GCATTCTAGT TGTGGTTTGT CCAAACTCAT 6360 

CAATGTATCT TATCATGTCT GTATACCGTC GACCTCTAGC TAGAGCTTGG CGTAATCATG 64 20 

GTCATAGCTG TTTCCTGTGT GAAATTGTTA TCCGCTCACA ATTCCACACA ACATACGAGC 6480 

CGGAAGCATA AAGTGTAAAG CCTGGGGTGC CTAATGAGTG AGCTAACTCA CATTAATTGC 654 0 

GTTGCGCTCA CTGCCCGCTT TCCAGTCGGG AAACCTGTCG TGCCAGCTGC ATTAATGAAT 6600 

CGGCCAACGC GCGGGG.-.GPxG GCGGTTTGCG TATTGGGCGC TCTTCCGCTT CCTCGCTCAC 6660 
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TGACTCGCTG CGCTCGGTCG TTCGGCTGCG GCGAGCGGTA TCAGCTCACT CAAAGGCGGT 6720 

AATACGGTTA TCCACAGAAT CAGGGGATAA . CGCAGGAAAG AACATGTGAG CAAAAGGCCA 6780 

GCAAAAGGCC AC3GAACCGTA AAAAGGCCGC GTTGCTGGCG TTTTTCCATA GGCTCCGCCC 684 0 

CCCTGACGAG "CATCACAAAA ATCGACGCTC AAGTCAGAGG TGGCGAAACC CGACAGGACT 6900 

ATAAAGATAC CAGGCGTTTC CCCCTGGAAG CTCCCTCGTG CGCTCTCCTG TTCCGACCCT 6960 



GCCGCTTACC GGATACC7GT CCGCCTTTCT CCCTTCGGGA AGCGTGGCGC TTTCTCAATG 7020 
CTCACGCTGT AGGTATCTCA GTTCGGTGTA GGTCGTTCGC TCCAAGCTGG GCTGTGTGCA 7080 



CGAACCCCCC GTTCAGCCCG ACCGCTGCGC CTTATCCGGT AACTATCGTC TTGAGTCCAA 714 0 

CCCGGTAAGA CACGACTTAT CGCCACTGGC AGCAGCCACT GGTAACAGGA TTAGCAGAGC 7200 

GAGGTATGTA GGCGGTGCTA CAGAGTTCTT GAAGTGGTGG CCTAACTACG GCTACACTAG 7260 

AAGdACA&TA TTTGGTATCT GCGCTCTGCT GAAGCCAGTT ACCTTCGGAA AAAGAGTTGG 7320 

TAGCTCTTGA TCCGGCAAAC AAACCACCGC TGGTAGCGGT GGTTTTTTTG TTTGCAAGCA 7380 

GCAGATTACG CGCAGAAAAA AAGGATCTCA AGAAGATCCT TTGATCTTTT CTACGGGGTC 7440 

TGACGCTCAG TGGAACGAAA ACTCACGTTA AGGGATTTTG GTCATGAGAT TATCAAAAAG 7 500 

GAT CTTCAGC TAGATCC7TT TAAATTAAAA ATGAAGTTTT AAATCAATCT AAAGTATATA 7560 

TGAGTAAACT TGGTCTGACA GTTACCAATG CTTAATCAGT GAGGCACCTA TCTCAGCGAT 7 620 

CTGTjGM,TTT * CGTTCATCCA TAGTTGCCTG ACTCCCCGTC GTGTAGATAA CTACGATACG " 7680 

GGaSgGCTTA CCATCTGGCC CCAGTGCTGC . AATGATACCG CGAGACCCAC GCTCACCGGC 7740 

TCCAGATTTA TCAGCAA7AA ACCAGCCAGC CGGAAGGGCC GAGCGCAGAA GTGGTCCTGC* 7800 

AACTTTATCC GCCTCCATCC AGTCTATTAA TTGTTGCCGG GAAGCTAGAG TAAGTAGTTC 7860 

GCCAGTTAAT AGTTTGCGCA ACGTTGTTGC CATTGCTACA GGCATCGTGG TGTCACGCTC 7920 

GTCGTTTGGT ATGGCTTCAT TCAGCTCCGG TTCCCAACGA TCAAGGCGAG TTACATGATC 7980 

CCCCATGTTG TGCAAAAAAG CGGTTAGCTC CTTCGGTCCT CCGATCGTTG TCAGAAGTAA 8040 

GTTGGCCGCA GTGTTATCAC TCATGGTTAT GGCAGCACTG CATAATTCTC TTACTGTCAT 8100 

GCCATCCGTA AGATGCT7TT CTGTGACTGG TGAGTACTCA ACCAAGTCAT TCTGAGAATA 8160 

GtGTATGCGG CGACCGAGTT GCTCTTGCCC GGCGTCAATA CGGGATAATA CCGCGCCACA 8220 

TAGCAGAACT TTAAAAGTGC TCATCATTGG AAAACGTTCT TCGGGGCGAA AACTCTCAAG 8280 

GATCTSCCG CTGTTGAGAT CCAGTTCGAT GTAACCCACT CGTGCACCCA ACTGATCTTC 8340 

AGCATGTTTT ACTTTCACCA GCGTTTCTGG GTGAGCAAAA ACAGGAAGGC AAAATGCCGC 8400 

AAAAAAGGGA ATAAGGGCGA CACGGAAATG TTGAATACTC ATACTCTTCC TTTTTCAATA 84 60 



TTATTGAAGC ATTTATCAGG GTTATTGTCT GATGAGCGGA TACATATTTG AATGTATTTA 8520 
GAAAAATAAA CAAATAGGGG TTCCGCGCAC ATTTCCCCGA AAAGTGCCAC CTGACGTC 8578 
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m nrafsrkkdkn vmhtpealskJifipvna kflgsteveqpkgte vvrdavrklkfarh 
velqisiygvkilepktkevqhncqlhrisfcaddfo^ 

afdlaytkflesggkdvetrkqiaglqkriqdletenmelknkvqdlenqlritqvsappagsm^ 

difdmipfspishqssmp tmgtqpppvpsrst eikrdlfgaepfdpfncgaadfppdi 

kmgltlegtvfcldpldsrc 
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ID pGA1028 circular DNA; 5021 BP 

DE hCed-6cds in pBAD HisA 

CC http : / /www. inf orroaxinc . com/ 

CC pGA1028 in Top 10: 

CC VNTAUTHORNAME I Nina cromheecke I 

FT CDS 8. .919 

FT /vntifkey="4" 

FT /iabel-hsCED-6 

FT CDS 4918. .4935 

FT . /vntifkey»"4" 

FT /iabel=HIS-tag 

FT misc_feature 4975.. 4998 

FT -t /vntifkey="21" 

FT . , /label-anti\Xprjass\epitope' 

FT misc_feature 4954.. 4998 

FT /vntifkey="21 M 

FT /Iabel=EK\recognition\site 

FT CDS 1473.. 2333 

FT /vr.tifkey="4" 

FT /iabel=Amp 

FT rep_origin 2478.. 3151 

FT /vntifkey«"33" 

FT /label-ColEl 

FT CDS 3682.. 4560 

FT /vntifkey«"4" 

FT ' . /label=AraC\ORF 

SQ SEQUENCE 5021 B?; - 

GATCOCCATG AACCGTGCTT TTAGCAGGAA GAAAGACAAA ACATGGATGC ATACACCTGA 60 
AGCTTTATCA AAACATTTCA TTCCCTATAA TGCAAAGTTT CTTGGCAGTA CAGAAGTGGA 120 
ACAGCCAAAA GGAACAGAAG TTGTGAGAGA TGCTGTAAGG AAACTAAAGT TTGCAAGACA 180 
TATCAAGAAA TCTGAAGGCC AGAAAATTCC TAAAGTGGAG TTGCAAATAT CAATTTATGG 240 
AGTAAAAATT CTAGAACCCA AAACAAAGGA AGTTCAACAC AATTGCCAGC TTCATAGAAT 300 
ATCTTTTTGT GCAGATGATA AAACTGACAA GAGGATATTC ACTTTCATAT GCAAAGATTC 360 
TGAGT.^AAT- AAACAT7TGT GCTATGTATT TGACAGCGAA AAGTGTGCTG AAGAGATCAC 4 20 

TTTAACAATT GGCCAAGCAT TTGACCTGGC ATACACGAAA TTTCTAGAAT CAGGAGGAAA 4 80 

AGATGTTGM/ACAAGAAAAC AGATCGCAGG GTTACAAAAA AGAATCCAAG ACTTAGAAAC 540 
AGAAMTATG GAACTTAAAA ATAAAGTACA AGATTTGGAA AACCAACTGA GAATAACTCA 600 
AGTATCAGCA CCTCCAGCAG GCAGTATGAC ACCTAAGTCG CCCTCCACTG ACATCTTTGA 660 
TATGATTCCA TTTTCTCCAA TATCACACCA GTCTTCGATG CCTACTCGCA ATGGCACACA 720 
GCCAGGTCCA GTACCTAGTA GATCTACTGA GATTAAACGG GACCTGTTTG GAGCAGAACC . 780 
TTTT^^CA. TTTAAC7GTG GAGCAGCAGA TTTCCCTCCA GATATTCAAT CAAAATTAGA 840 
tSaGATGCAG . GAGGGG7TCA AAATGGGACT AACTCTTGAA GGCACAGTAT TTTGTCTCGA 900 
CCCGTT^AC ^TAGGTGCT GAGTCGACGG TACCATATGG GAATTCGAAG CTTGGCTGTT 960 
TTGGCGGATG AGAGAAGATT TTCAGCCTGA TACAGATTAA ATCAGAACGC AGAAGCGGTC 1020 
TGATAAAACA GAATTTGCCT GGCGGCAGTA GCGCGGTGGT CCCACCTGAC CCCATGCCGA 1080 
ACTCAGAAGT GAAACGCCGT AGCGCCGATG GTAGTGTGGG GTCTCCCCAT GCGAGAGTAG 1140 
GGAACTGCCA GGCATCAAAT AAAACGAAAG GCTCAGTCGA AAGACTGGGC CTTTCGTTTT 1200 
ATCTGTTGTT TGTCGG7GAA CGCTCTCCTG AGTAGGACAA ATCCGCCGGG AGCGGATTTG 1260 
AACGTTGCGA AGCAACGGCC CGGAGGGTGG CGGGCAGGAC GCCCGCCATA AACTGCCAGG 1320 
CATCAAATTA AGCAGA-.GGC CATCCTGACG GATGGCCTTT TTGCGTTTCT ACAAACTCTT 1380 
TTTGTTTATT TTTCTAaATA~ CATTCAAATA TGTATCCGCT CATGAGACAA TAACCCTGAT 14 40 

AAATGCTTCA ATAATA77GA AAAAGGAAGA GTATGAGTAT TCAACATTTC CGTGTCGCCC 1500 
TTATTCCCTT TTTTGCGGCA TTTTGCCTTC CTGTTTTTGC TCACCCAGAA ACGCTGGTGA 1560 
AAGTAAAAGA TGCTGAAGAT CAGTTGGGTG CACGAGTGGG TTACATCGAA CTGGATCTCA 1620 
ACAGCGGTAA GATCCT7GAG AGTTTTCGCC CCGAAGAACG TTTTCCAATG ATGAGCACTT 1680 
TTAAAGTTCT GCTATG7GGC GCGGTATTAT CCCGTGTTGA CGCCGGGCAA GAGCAACTCG 17 40 

GTCGCCGCAT ACACTA77CT CAGAATGACT TGGTTGAGTA CTCACCAGTC ACAGAAAAGC 1800 
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ATCTTACGGA TGGCATGACA GTAAGAGAAT TATGCAGTGC TGCCATAACC ATGAGTGATA 18 60 

ACACTGCGGC CAACTTACTT CTGACAACGA TCGGAGGACC GAAGGAGCTA ACCGCTTTTT 1920 

TGCACAACAT GGGGGATCAT GTAACTCGCC TTGATCGTTG GGAACCGGAG CTGAATGAAG "1980 

CCATACCAAA CGACGAGCGT GACACCACGA TGCCTGTAGC AATGGCAACA ACGTTGCGCA 2040 

AACTATTAAC TGGCGAACTA CTTACTCTAG CTTCCCGGCA ACAATTAATA GACTGGATGG 2100 

AGGCGGATAA AGTTGCAGGA CCACTTCTGC GCTCGGCCCT TCCGGCTGGC TGGTTTATTG 2160 

CTGATAAATC TGGAGCCGGT GAGCGTGGGT CTCGCGGTAT CATTGCAGCA CTGGGGCCAG 2220 

ATGGTAAGCC CTCCCGTATC GTAGTTATCT ACACGACGGG GAGTCAGGCA ACTATGGATG 2280 

AACGAAATAG ACAGATCGCT GAGATAGGTG CCTCACTGAT TAAGCATTGG TAACTGTCAG 2340 

ACCAAGTTTA CTCATATATA CTTTAGATTG ATTTAAAACT TCATTTTTAA TTTAAAAGGA 2400 

TCTAGGTGAA GATCCTTTTT GATAATCTCA TGACCAAAAT CCCTTAACGT GAGTTTTCGT 24 60 

TCCACTGAGC GTCAGACCCC GTAGAAAAGA TCAAAGGATC TTCTTGAGAT CCTTTTTTTC 2520 

TGCGCGT AAT CTGCTGCTTG CAAACAAAAA AACCACCGCT ACCAGCGGTG GTTTGTTTGC 2580 

CGGATCAAGA GCTACCAACT CTTTTTCCGA AGGTAACTGG CTTCAGCAGA GCGCAGATAC 2640 

CAAATACTGT CCTTCTAGTG TAGCCGTAGT TAGGCCACCA CTTCAAGAAC TCTGTAGCAC 2700 

CGCCTACATA CCTCGCTCTG CTAATCCTGT TACCAGTGGC TGCTGCCAGT GGCGATAAGT . 27 60 

CGTGTCTTAC CGGGTTGGAC TCAAGACGAT AGTTACCGGA TAAGGCGCAG CGGTCGGGCT 2820 

GAACGGGGGG TTCGTGCACA CAGCCCAGCT TGGAGCGAAC GACCTACACC GAACTGAGAT 2880 

ACCTACAGCG TGAGCTATGA GAAAGCGCCA CGCTTCCCGA AGGGAGAAAG GCGGACAGGT 294 0 

ATCCGGTAAG CGGCAGGGTC GGAACAGGAG AGCGCACGAG GGAGCTTCCA GGGGGAAACG 3000 

CCTGGTATCT TTATAGTCCT GTCGGGTTTC GCCACCTCTG ACTTGAGCGT CGATTTTTGT 3060 

GATGCTCGTC AGGGGGGCGG AGCCTATGGA AAAACGCCAG CAACGCGGCC TTTTTACGGT 3120 

TCCTGGCCTT TTGCTGGCCT TTTGCTCACA TGTTCTTTCC TGCGTTATCC CCTGATTCTG 3180 

TGGATAACCG TATTACCGCC TTTGAGTGAG CTGATACCGC TCGCCGCAGC CGAACGACCG 3240 

AGCGCAGCGA GTCAGTGAGC GAGGAAGCGG AAGAGCGCCT GATGCGGTAT TTTCTCCTTA 3300 

CGCATCTGTG CGGTATTTCA CACCGCATAT GGTGCACTCT CAGTACAATC TGCTCTGATG 3360 

CCGCATAGTT AAGCCAGTAT ACACTCCGCT ATCGCTACGT GACTGGGTCA TGGCTGCGCC 3420 

CCGACACCCG. CCAACACCCG CTGACGCGCC CTGACGGGCT TGTCTGCTCC CGGCATCCGC 3480 

TTACAGACAA GCTGTGACCG TCTCCGGGAG CTGCATGTGT CAGAGGTTTT CACCGTCATC 354 0 

ACCGAAACGC GCGAGGCAGC AGATCAATTC GCGCGCGAAG GCGAAGCGGC ATGCATAATG 3600 

TGCCTGTCAA ATGGACGAAG CAGGGATTCT GCAAACCCTA TGCTACTCCG TCAAGCCGTC 3660 

AATTGTCTGA TTCGTTACCA ATTATGACAA CTTGACGGCT ACATCATTCA CTTTTTCTTC 3720 

ACAACCGGCA CGGAACTCGC TCGGGCTGGC CCCGGTGCAT TTTTTAAATA CCCGCGAGAA 3780 

ATAGAGTTGA TCGTCAAAAC CAACATTGCG ACCGACGGTG GCGATAGGCA TCCGGGTGGT 384 0 

GCTCAAAAGC AGCTTCGCCT GGCTGATACG TTGGTCCTCG CGCCAGCTTA AGACGCTAAT 3900 

CCCTAACTGC TGGCGGAAAA GATGTGACAG ACGCGACGGC GACAAGCAAA CATGCTGTGC 3960 

GACGCTGGCG ATATCAAAAT TGCTGTCTGC CAGGTGATCG CTGATGTACT GACAAGCCTC 4020 

GCGTACCCGA TTATCCATCG GTGGATGGAG CGACTCGTTA ATCGCTTCCA TGCGCCGCAG 4080 

TAACAATTGC TCAAGCAGAT TTATCGCCAG CAGCTCCGAA TAGCGCCCTT CCCCTTGCCC 4140 

GGCGTTAATG ATTTGCCCAA ACAGGTCGCT GAAATGCGGC TGGTGCGCTT CATCCGGGCG 4200 

AAAGAACCCC GTATTGGCAA ATATTGACGG CCAGTTAAGC CATTCATGCC AGTAGGCGCG 4260 

CGGACGAAAG TAAACCCACT GGTGATACCA TTCGCGAGCC TCCGGATGAC GACCGTAGTG 4320 

ATGAATCTCT CCTGGCGGGA ACAGCAAAAT ATCACCCGGT CGGCAAACAA ATTCTCGTCC 4 38 0 

CTGATTTTTC ACCACCCCCT. GACCGCGAAT GGTGAGATTG AGAATATAAC CTTTCATTCC 4 440 

CAGCGGTCGG TCGATAAAAA AATCGAGATA ACCGTTGGCC TCAATCGGCG TTAAACCCGC 4500 

CACCAGATGG GCATTAAACG AGTATCCCGG CAGCAGGGGA TCATTTTGCG CTTCAGCCAT 4560 

ACTTTTCATA CTCCCGCCAT TCAGAGAAGA AACCAATTGT CCATATTGCA TCAGACATTG 4620 

CCGTCACTGC GTCTTTTACT GGCTCTTCTC GCTAACCAAA CCGGTAACCC CGCTTATTAA 4 680 

AAGCATTCTG TAACAAAGCG GGACCAAAGC CATGACAAAA ACGCGTAACA AAAGTGTCTA 4740 

TAATCACGGC AGAAAAGTCC ACATTGATTA TTTGCACGGC GTGACACTTT GCTATGCCAT 4 800 

AGCATTTTTA TCCATAAGAT TAGCGGATCC TACCTGACGC TTTTTATCGC AACTCTCTAC 4 860 

TGTTTCTCCA TACCCGTT7T TTTGGGCTAA CAGGAGGAAT TAACCATGGG GGGTTCTCAT 4 920 

CATCATCATC ATCATGGTAT GGCTAGCATG ACTGGTGGAC AGCAAATGGG TCGGGATCTG 4 980 

TACGACGATG ACGATAAGGA TCGATGGGGA TCCGAGCTCG A 5021 
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1 GGTACCGAGC TCTTACGCGT GCTAGCCCGG GCTCGAGATC TGCGATCTGC 
CCATGGCTCG AGAATGCGCA CGATCGGGCC CGAGCTCTAG ACGCTAGACG 

51 ATCTCAAT7A GTCAGCAACC ATAGTCCCGC CCCTAACTCC GCCCATCCCG 
TAGAGTTAAT CAGTCGTTGG TATCAGGGCG GGGATTGAGG CGGGTAGGGC 

101 CCCCTAACTC' CGCCCAGTTC CGCCCATTCT CCGCCCCATC GCTGACTAAT 
GGGGATTGAG GCGGGTCAAG GCGGGTAAGA GGCGGGGTAG CGACTGATTA 

151 TTTTTTTATT TATGCAGAGG CCGAGGCCGC CTCGGCCTCT GAGCTATTCC 
AAAAAAATAA ATACGTCTCC GGCTCCGGCG GAGCCGGAGA CTCGATAAGG 

201 AGAAGTAGTG AGGAGGCTTT TTTGGAGGCC TAGGCTTTTG CAAAAAGCTT 
TCTTCATCAC TCCTCCGAAA AAACCTCCGG ATCCGAAAAC GTTTTTCGAA 

251 GGCATTCCGG TACTGTTGGT AAAGCCACCA TGGAAGACGC CAAAAACATA 
CCGTAAGGCC ATGACAACCA TTTCGGTGGT ACCTTCTGCG GTTTTTGTAT 

301 AAGAAAGGCC CGGCGCCATT CTATCCGCTG GAAGAJGGAA CCGCTGGAGA 
TTCTTTCCGG GCCGCGGTAA GAT AGGCG AC CTTCTACCTT GGCGACCTCT 

351 GCAACTGCAT AAGGCTATGA AGAGATACGC CCTGGTTCCT GGAACAATTG 
CGTTGACGTA TTCCGATACT TCTCTATGCG GGACCAAGGA CCTTGTTAAC 

401 CTTTTACAGA TGCACATATC GAGGTGGACA TCACTTACGC T GAG TACTIC 
GAAAATGTCT ACGTGTATAG CTCCACCTGT AGTGAATGCG ACTCATGAAG 

4 51 GAAATGTCCG TTCGGTTGGC AGAAGCTATG AAACGATATG GGCTGAATAC 
CTTTACAGGC AAGCCAACCG TCTTCGATAC TTTGCTATAC CCGACTTATG 

501. AAATCACAGA ATCGTCGTAT GCAGTGAAAA CTCTCTTCAA TTCTTTATGC 
TTTAGTGTCT TAGCAGCATA CGTCACTTTT GAGAGAAGTT AAGAAATACG 

551 CGGTGTTGGG CGCGTTATTT ATCGGAGTTG CAGTTGCGCC CGCGAACGAC 
GCCACAACCC GCGCAATAAA TAGCCTCAAC GTCAACGCGG GCGCTTGCTG 

601 ATTTATAATG AACGTGAATT GCTCAACAGT ATGGGCATTT CGCAGCCTAC 
TAAATATTAC TTGCACTTAA CGAGTTGTCA TACCCGTAAA GCGTCGGATG 

651 CGTGGTGTTC GTTTCCAAAA AGGGGTTGCA AAAAATTTTG AACGTGCAAA 
GCACCACAAG CAAAGGTTTT TCCCCAACGT TTTTTAAAAC TTGCACGTTT 

701 AAAAGCTCCC AATCATCCAA AAAATTATTA TCATGGATTC TAAAACGGAT 
TTTTCGAGGG TTAGTAGGTT TTTTAATAAT AGTACCTAAG ATTTTGCCTA 

751 TACCAGGGAT TTCAGTCGAT GTACACGTTC GTCACATCTC ATCTACCTCC 
ATGGTCCCTA AAGTCAGCTA CATGTGCAAG CAGTGTAGAG TAGATGGAGG 

801 CGGTTTTAAT GAATACGATT TTGTGCCAGA GTCCTTCGAT AGGGACAAGA 
GCCAAAATTA CTTATGCTAA AACACGGTCT CAGGAAGCTA TCCCTGTTCT 

8 51 CAATTGCACT GATCATGAAC TCCTCTGGAT CTACTGGTCT GCCTAAAGGT 
GTTAACGTGA CTAGTACTTG AGGAGACCTA GATGACCAGA CGGATTTCCA 
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901 GTCGCTCTGC CTCATAGAAC 
CAGCGAGACG GAGTATCTTG 

951 TCCTATTTTT GGCAATCAAA 
AGGATAAAAA CCGTTAGTTT 

1001 TTCCATTCCA TCACGGTTTT 
AAGGTAAGGT AGTGCCAAAA 

1051 TGTGGATTTC GAGTCGTCTT 
ACACCTAAAG CTCAGCAGAA 

1101 GAGGAGCCTT CAGGATTACA 
CTCCTCGGAA GTCCTAATGT 

1151 TAT.TCTCCTT CTTCGCCAAA 
ATAAGAGGAA GAAGCGGTTT 

1201 AATTTACACG AAATTGCTTC 
TTAAATGTGC TTTAACGAAG 

1251 GGAAGCGGTT GCCAAGAGGT 
CCTTCGCCAA CGGTTCTCCA 

1301 GGCTCACTGA GACTACATCA 
■ CCGAGTGACT CTGATGTAGT 

1351 AAACCGGGCG CGGTCGGTAA 
TTTGGCCCGC GCCAGCCATT 

■ j " 

1401 ■ GGATCTGGAT ACCGGGAAAA 
CCTAGACCTA TGGCCCTTTT 

1451 GTGTGAGAGG TCCTATGATT 
CACACTCTCC AGGATACTAA 

1501 ACCAACGCCT TGATTGACAA 
TGGTTGCGGA ACTAACTGTT 

1551, - TTACTGGGAC GAAGACGAAC 
• 'AATGACCCTG CTTCTGCTTG 

1601. TGATTAAGTA CAAAGGCTAT 
ACTAATTCAT GTTTCCGATA 

1651 TTGCTCCAAC ACCCCAACAT 
AACGAGGTTG TGGGGTTGTA 

1701 CGATGACGCC GGTGAACTTC 
GCTACTGCGG CCACTTGAAG 

1751 AGACGATG AC GGAAAAAGAG 
TCTGCTACTG CCTTTTTCTC 

1801 ACCGCGAAAA AGTTGCGCGG 
TGGCGCTTTT TCAACGCGCC 




TGCCTGCGTG AGATTCTCGC ATGCCAGAGA 
ACGGACGCAC TCTAAGAGCG TACGGTCTCT 

TCATTCCGGA TACTGCGATT TTAAGTGTTG 
AGTAAGGCCT ATGACGCTAA AATTCACAAC 

GGAATGTTTA CTACACTCGG ATATTTGATA 
CCTTACAAAT GATGTGAGCC TATAAACTAT 

AATGTATAGA TTTGAAGAAG AGCTGTTTCT 
TTACATATCT AAACTTCTTC TCGACAAAGA 

AGATTCAAAG TGCGCTGCTG GTGCCAACCC 
TCTAAGTTTC ACGCGACGAC CACGGTTGGG 

AGCACTCTGA TTGACAAATA CGATTTATCT 
TCGTGAGACT AACTGTTTAT GCTAAATAGA 

TGGTGGCGCT CCCCTCTCTA AGGAAGT CGG 
ACCACCGCGA GGGGAGAGAT TCCTTCAGCC 

TCCATCTGCC AGGTATCAGG CAAGGATATG 
AGGTAGACGG TCCATAGTCC GTTCCTATAC 

GCTATTCTGA TTACACCCGA GGGGGATGAT 
CGATAAGACT AATGTGGGCT CCCCCTACTA. 

AGTTGTTCCA TTTTTTGAAG CGAAGGTTGT 
TCAACAAGGT AAAAAACTTC GCTTCCAACA 

CGCTGGGCGT TAATCAAAGA GGCGAACTGT 
GCGACCCGCA ATTAGTTTCT CCGCTTGACA 

ATGTCCGGTT ATGTAAACAA TCCGGAAGCG 
TACAGGCCAA TACATTTGTT AGGCCTTCGC 

GGATGGATGG CTACATTCTG GAGACATAGC 
CCTACCTACC GATGTAAGAC CTCTGTATCG 

ACTTCTTCAT CGTTGACCGC CTGAAGTCTC 
TGAAGAAGTA GCAACTGGCG * GACTTCAGAG 

CAGGTGGCTC CCGCTGAATT GGAATCCATC 
GTCCACCGAG GG CG ACTTAA CCTTAGGTAG 

CTTCGACGCA GGTGTCGCAG GTCTTCCCGA 
GAAGCTGCGT CCACAGCGTC CAGAAGGGCT 

CCGCCGCCGT TGTTGTTTTG GAGCACGGAA 
GGCGGCGGCA ACAACAAAAC CTCGTGCCTT 

ATCGTGGATT ACGTCGCCAG TCAAGTAACA 
TAGCACCTAA TGCAGCGGTC AGTTCATTGT 

AGGAGTTGTG TTTGTGGACG AAGTACCGAA 
TCCTCAACAC AAACACCTGC TTCATGGCTT 
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1851 AGGTCTTACC GGAAAACTCG ACGCAAGAAA AATCAGAGAG ATCCTCATAA 
TCCAGAATGG CCTTTTGAGC TGCGTTCTTT TTAGTCTCTC TAGGAGTATT 

1901 AGGCCAAGAA GGGCGGAAAG ATCGCCGTGT AATTCTAGAG TCGGGGCGGC 
TCCGGTTCTT CCCGCCTTTC TAGCGGCACA TTAAGATCTC AGCCCCGCCG 

1951 CGGCCGCTTC GAGCAGACAT GATAAGATAC ATTGATGAGT TTGGACAAAC 
GCCGGCGAAG CTCGTCTGTA CTATTCTATG TAACTACTCA AACCTGTTTG 

2001 CACAACTAGA A7GCAGTGAA AAAAATGCTT TATTTGTGAA ATTTGTGATG 
GTGTTGATC7 7ACGTCACTT TTTTTACGAA ATAAACACTT TAAACACTAC 

2051 CTATTGCTTT A7TTGTAACC ATTATAAGCT GCAATAAACA AGTTAACAAC 
GATAACGAAA TAAACATTGG TAATATTCGA CGTTATTTGT TCAATTGTTG 

2101 AACAATTGCA TTCATTTTAT GTTTCAGGTT CAGGGGGkGG TGTGGGAGGT 
TTGTTAACGT AAGTAAAATA CAAAGTCCAA GTCCCCCTCC ACACCCTCCA 

2151 TTTTTAAAGC AAGTAAAACC TCTACAAATG TGGTAAAATC GATAAGGATC 
AAAAATTTCG TTCATTTTGG AGATGTTTAC ACCATTTTAG CTATTCCTAG 

2201 TGAACGATGG AGCGGAGAAT GGGCGGAACT GGGCGGAGTT AGGGGCGGGA 
ACTTGCTACC TCGCCTCTTA CCCGCCTTGA CCCGCCTCAA TCCCCGCCCT 

2251 TGGGCGGAGT IkGGGGCGGG ACTATGGTTG CTGACTAATT GAGATGCATG 
ACCCGCCTCA ATCCCCGCCC TGATACCAAC GACTGATTAA CTCTACGTAC 

2301 CTTTGCATAC TtCTGCCTGC ■ TGGGGAGCCT GGGGACTTTC CACACCTGGT 
GAAACGTATG AAGACGGACG ACCCCTCGGA CCCCTGAAAG GTGTGGACCA 

2351 TGCTGACTAA T7GAGATGCA TGCTTTGCAT ACTTCTGCCT GCTGGGGAGC 
ACGACTGAT7 AACTCTACGT ACGAAACGTA TGAAGACGGA CGACCCCTCG 

2401 CTGGGGACT7 TCCACACCCT AACTGACACA CATTCCACAG CGGATCCGTC 
GACCCCTGAA AGGTGTGGGA TTGACTGTGT GTAAGGTGTC GCCTAGGCAG 

2451 GACCGATGCC CTTGAGAGCC TTCAACCCAG TCAGCTCCTT CCGGTGGGCG 
CTGGCTACGG GAACTCTCGG AAGTTGGGTC AGTCGAGGAA GGCCACCCGC 

2501 CGGGGCATGA C7ATCGTCGC CGCACTTATG ACTGTCTTCT TTATCATGCA 
GCGCCGTAC7 GATAGCAGCG GCGTGAATAC TGACAGAAGA AATAGTACGT 

2551 ACTCGTAGGA CAGGTGCCGG CAGCGCTC7T CCGC7TCCTC GCTCACTGAC 
TGAGCATCC7 G7CCACGGCC GTCGCGAGAA GGCGAAGGAG CGAGTGAC7G 

2601 TCGCTGCGC7 CGGTCGT7CG GCTGCGGCGA GCGGTA7CAG CTCACTCAAA 
AGCGACGCGA GCC AGCAAGC CGACGCCGCT CGCCATAGTC GAGTGAGTTT 

2651 GGCGGTAA7A CGGTTATCCA CAGAATCAGG GG AT AACGC A GGAAAGAACA 
CCGCCATTA7 GCCAATAGGT GTCTTAGTCC CCTATTGCGT CCTTTCTTGT 

2701 TGTGAGCAAA AGGCCAGCAA AAGGCCAGGA ACCGTAAAAA GGCCGCGTTG 
ACACTCGTT7 7CCGGTCGTT TTCCGGTCCT TGGCATTTTT CCGGCGCAAC 

27 51 CTGGCGTTT7 7 CC AT AGGCT CCGCCCCCCT GACGP.GCATC ACAAAAATCG 
GACCGCAAAA A3GTATCCGA GGCGGGGGGA CTGCTCGTAG TG7TTTTAGC 
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2801 ACGCTCAAGT CAGAGGTGGC GAAACCCGAC AGGACTATAA AGATACCAGG 
. TGCGAGTTCA GTCTCCACCG CTTTGGGCTG TCCTGATATT TCTATGGTCC 

2851 CGTTTCCCCC T GG AAGCTCC CTCGTGCGCT CTCCTGTTCC GACCCTGCCG 
GCAAAGGGGG ACC.TTCGAGG GAGCACGCGA GAGGACAAGG CTGGGACGGC 

2901 CTTACCGGAT ACCTGTCCGC CTTTCTCCCT TCGGGAAGCG TGGCGCTTTC 
GAATGGCCTA TGGACAGGCG GAAAGAGGGA AGCCCTTCGC ACCGCGAAAG 

2951 TCAATGCTCA CGCTGTAGGT ATCTCAGTTC GGTGTAGGTC GTTCGCTCCA 
AGTTACGAGT GCGACATCCA TAGAGTCAAG CCACATCCAG CAAGCGAGGT 

3001 AGCTGGGCTG TGTGCACGAA CCCCCCGTTC AGCCCGACCG CTGCGCCTTA 
■TCGACCCGAC ACACGTGCTT GGGGGGCAAG TCGGGCTGGC GACGCGGAAT 

3051 TCCGGTAACT ATCGTCTTGA GTCCAACCCG GTAAGACACG ACTTATCGCC 
AGGCCATTGA TAGCAGAACT CAGGTTGGGC CATTCTGTGC TGAATAGCGG 

3101 ACTGGCAGCA GCCACTGGTA ACAGGATTAG CAGAGCGAGG TATGTAGGCG 
TGACCGTCGT CGGTGACCAT TGTCCTAATC GTCTCGCTCC ATACATCCGC 

3151 " GTGCTACAGA GTTCTTGAAG TGGTGGCCTA ACTACGGCTA CACTAGAAGG 
'CACGATGTCT CAAGAACTTC ACCACCGGAT TGATGCCGAT GTGATCTTCC 

3201 ACAGTATTTG GTATCTGCGC TCTGCTGAAG CCAGTTACCT TCGGAAAAAG 
; TGTCATAAAC CATAGACGCG AGACGACTTC GGTCAATGGA AGCCTTTTTC 

3251 AGTTGGTAGC TCTTGATCCG GCAAACAAAC CACCGCTGGT AGCGGTGGTT 
TCAACCATCG AGAACTAGGC CGTTTGTTTG GTGGCGACCA TCGCCACCAA 

3301 TTTTTGTTTG CAAGCAGCAG ATTACGCGCA GAAAAAAAGG ATCTCAAGAA 
AAAAACAAAC GTTCGTCGTC TAATGCGCGT CTTTTTTTCC TAGAGTTCTT 

3351 \GATCCTTTGA TCTTTTCTAC GGGGTCTGAC GCTCAGTGGA ACGAAAACTC 
CTAGGAAACT AGAAAAGATG CCCCAGACTG CGAGTCACCT TGCTTTTGAG 

3401 ACGTTAAGGG ATTTTGGTCA TGAGATTATC AAAAAGGATC TTCACCTAGA 
TGCAATTCCC TAAAACCAGT ACTCTAATAG TTTTTCCTAG AAGTGGATCT 

3451 . TCCTTTTAAA TTAAAAATGA AGTTTTAAAT CAATCTAAAG .TATATATGAG 
." AGGAAAATTT AATTTTTACT TCAAAATTTA GTTAGATTTC ATATATACTC 

3501 TAAACTTGGT CTGACAGTTA CCAATGCTTA ATCAGTGAGG CACCTATCTC 
- ATTTGAACCA GACTGTCAAT GG TTACGAAT TAG7CACT.CC GTGGATAGAG 

3551 AGCGATCTGT CTATTTCGTT CATCCATAGT TGCCTGACTC CCCGTCGTGT 
TCGCTAGACA GATAAAGCAA GTAGGTATCA ACGGACTGAG GGGCAGCAGA 

3601 AGATAACTAC GATACGGGAG GGCTTACCAT CTGGCCCCAG TGCTGCAATG 
TCTATTGATG CTATGCCCTC CCGAATGGTA GACCGGGGTC ACGACGTTAC 

3651 ATACCGCGAG ACCCACGCTC ACCGGCTCCA GATTTATCAG CAATAAACCA 
TATGGCGCTC TGGGTGCGAG TGGCCGAGGT CTAAATAGTC GTTATTTGGT 

3701 GCCAGCCGGA AGGGCCGAGC GCAGAAGTGG TCCTGCAACT TTATCCGCCT 
CGGTCGGCCT TCCCGGCTCG CGTCTTCACC AGGACGTTGA AATAGGCGGA 
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3751 CCATCCAGTC TATTAATTGT TGCCGGGAAG CTAGAGTAAG TAGTTCGCCA 
GGTAGGTCAG ATAATTAACA ACGGCCCTTC GATCTCATTC ATCAAGCGGT 

3801 GTTAATAGTT TGCGCAACGT TGTTGCCATT GCTACAGGCA TCGTGGTGTC 
CAATTATCAA ACGCGTTGCA ACAACGGTAA CGATGTCCGT AGCACCACAG 

38 51 ACGCTCGTCG TTTGGTATGG CTTCATTCAG CTCCGGTTCC CAACGATCAA 
TGCGAGCAGC AAACCATACC GAAGTAAGTC GAGGCCAAGG GTTGCTAGTT 

3901 GGCGAGTTAC ATGATCCCCC ATGTTGTGCA AAAAAGCGGT TAGCTCCTTC 
CCGCTCAATG TACTAGGGGG TACAACACGT TTTTTCGCCA ATCGAGGAAG 

3951 GGTCCTCCGA TCGTTGTCAG AAGTAAGTTG GCCGCAGTGT TATCACTCAT 
' CCAGGAGGCT AGCAACAGTC TTCATTCAAC CGGCGTCACA ATAGTGAGTA 

4001 GGTTATGGCA GCACTGCATA ATTCTCTTAC TGTCATGCCA TCCGTAAGAT 
CCAATACCGT CGTGACGTAT TAAGAGAATG ACAGTACGGT AGGCATTCTA 

4051 GCTTTTCTGT GACTGGTGAG TACTCAACCA AGTCATTCTG AGAATAGTGT 
CGAAAAGACA CTGACCACTC ATGAGTTGGT TCAGTAAGAC TCTTATCACA 

4101 ATGCGGCGAC CGAGTTGCTC TTGCCCGGCG TCAATACGGG ATAATACCGC 
TACGCCGCTG GCTCAACGAG AACGGGCCGC AGTTATGCCC TATTATGGCG 

4151 GCCACATAGC AGAACTTTAA AAGTGCTCAT CATTGGAAAA CGTTCTTCGG 
CGGTGTATCG TCTTGAAATT TTCACGAGTA GTAACCTTTT GCAAGAAGCC 

4201 GGCGAAAACT CTCAAGGATC TTACC'GCTGT TGAGATCCAG TTCGATGTAA 
'CCGCTTTTGA GAGTTCCTAG AATGGCGACA ACTCTAGGTC AAGCTACATT 

4 2 51 CCCACTCGTG CACCCAACTG ATCTTCAGCA TCTTTTACTT TCACCAGCGT 
GGGTGAGCAC GTGGGTTGAC TAGAAGTCGT AGAAAATGAA AGTGGTCGCA 

4 301 TTCTGGGTGA GCAAAAACAG GAAGGCAAAA TGCCGCAAAA AAGGGAATAA 
AAGACCCACT CGTTTTTGTC CTTCCGTTTT ACGGCGTTTT TTCCCTTATT 

4351 GGGCGACACG GAAATGTTGA AT ACT CAT AC TCTTCCTTTT TCAATATTAT 
CCCGCTGTGC CTTTACAACT TATGAGTATG AGAAGGAAAA AGTTATAATA 

44 01 TGAAGCATTT ATCAGGGTTA TTGTCTCATG AGCGGATACA TATTTGAATG 
ACTTCGTAAA TAGTCCCAAT AACAGAGTAC TCGCCTATGT ATAAACTTAC 

4 451 TATTTAGAAA AATAAACAAA TAGGGGTTCC GCGCACATTT CCCCGAAAAG 
ATAAATCTTT TTATTTGTTT ATCCCCAAGG CGCGTGTAAA GGGGCTTTTC 

4 501 TGCCACCTGA CGCGCCCTGT AGCGGCGCAT TAAGCGCGGC GGGTGTGGTG 
ACGGTGGACT GCGCGGGACA TCGCCGCGTA ATTCGCGCCG CCCACACCAC 

4551 GTTACGCGCA GCGTGACCGC TACACTTGCC AGCGCCCTAG CGCCCGCTCC 
CAATGCGCGT CGCACTGGCG ATGTGAACGG TCGCGGGATC GCGGGCGAGG 

4 601 TTTCGCTTTC TTCCCTTCCT TTCTCGCCAC GTTCGCCGGC TTTCCCCGTC 
AAAGCGAAAG AAGGGAAGGA AAGAGCGGTG CAAGCGGCCG AAAGGGGCAG 

4 651 AAGCTCTAAA TCGGGGGCTC CCTTTAGGGT TCCGATTTAG TGCTTTACGG 
TTCGAGATTT AGCCCCCGAG GGAAATCCCA AGGCTAAATC ACGAAAT GCC 
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4701 CACCTCGACC CCAAAAAACT TGATTAGGGT GATGGTTCAC GTAGTGGGCC 
GTGGAGCTGG GGTTTTTTGA ACTAATCCCA CTACCAAGTG CATCACCCGG 

4751 ATCGCCCTGA TAGACGGTTT TTCGCCCTTT GACGTTGGAG TCCACGTTCT 
f AGCGGGACT ATCTGCCAAA AAGCGGGAAA CTGCAACCTC AGGTGCAAGA 

4801 TTAATAGTGG ACTCTTGTTC CAAACTGGAA CAACACTCAA CCCTATCTCG 
AATTATCACC TGAGAACAAG GTTTGACCTT GTTGTGAGTT GGGATAGAGC 

4851 GTCTATTCTT TTGATTTATA AGGGATTTTG CCGATTTCGG CCTATTGGTT 
CAGATAAGAA AACTAAATA? TCCCTAAAAC GGCTAAAGCC GGATAACCAA 

4901 ^AAAATGAG CTGATTTAAC AAAAATTTAA CGCGAATTTT AACAAAATAT 
" TTTTTTACTC GACTAAATTG TTTTTAAATT GCGCTTAAAA TTGTTTTATA 

4 951 TAACGTTTAC AATTTCCCAT TCGCCATTCA GGCTGCGCAA CTGTTGGGAA 
ATTGCAAATG TTAAAGGGTA AGCGGTAAGT CCGACGCGTT GACAACCCTT 

5001 GGGCGATCGG TGCGGGCCTC TTCGCTATTA CGCCAGCCCA AGCTACCATG 
CCCGCTAGCC ACGCCCGGAG AAGCGATAAT GCGGTCGGGT TCGATGGTAC 

' . J .ijt^.r-,.,-. ... 

5051 ATAAGTAAGT AATATTAAGG TACGGGAGGT ACTTGGAGCG GCCGCAATAA 
TATTCATTCA TTATAATTCC ATGCCCTCCA TGAACCTCGC CGGCGTTATT 

5101 AATATCTTTA TTTTCATTAC ATCTGTGTGT TGGTTTTTTG TGTGAATCGA 
TTATAGAAAT AAAAGTAATG TAGACACACA ACCAAAAAAC ACACTTAGCT 

5151 TAGTACTAAC ATACGCTCTC CATCAAAACA AAACGAAACA AAACAAACTA 
ATCATGATTG TATGCGAGAG GTAGTTTTGT TTTGCTTTGT TTTGTTTGAT 

5261 -.^-^^^iTAGG CTGTCCCCAG TGCAAGTGCA GGTGCCAGAA CATTTCTCTA 
G^fHfATCC "GACAGGGGTC ACGTTCACGT CCACGGTCTT GTAAAGAGAT 

5251 TCGATA 
• AGCTAT 
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SQ SEQUENCE . 4735 BP 

TAGTTATTAA TAGTAATCAA TTACGGGGTC ATTAGTTCAT AGCCCATATA TGGAGTTCCG 

60 

CGTTACATAA CTTACGGTAA ATGGCCCGCC TGGCTGACCG CCCAACGACC CCCGCCCATT 

120 

GACGTCAATA ATGACGTATG TTCCCATAGT AACGCCAATA GGGACTTTCC ATTGACGTCA 

180 

ATGGGTGGAG TATTTACGGT AAACTGCCCA CTTGGCAGTA CATCAAGTGT ATCATATGCC 

240 

AAGTACGCCC CCTATTGACG TCAATGACGG TAAATGGCCC GCCTGGCATT ATGCCCAGTA 

300 

CATGACCTTA TGGGACTTTC CTACTTGGCA GTACATCTAC GTATTAGTCA TCGCTATTAC 

360 

CATGGTGATG CGGTTTTGGC AGTACATCAA TGGGCGTGGA TAGCGGTTTG ACTCACGGGG. 

420 

ATTTCCAAGT CTCCACCCCA TTGACGTCAA TGGGAGTTTG TTTTGGCACC AAAATCAACG 

480 

GGACTTTCCA AAATGTCGTA ACAACTCCGC CCCATTGACG CAAATGGGCG GTAGGCGTGT 

540 

ACGGTGGGAG GTCTATATAA GCAGAGCTGG TTTAGTGAAC CGTCAGATCC GCTAGCGCTA 

600 

CCGGTCGCCA CCATGGTGAG CAAGGGCGAG GAGCTGTTCA CCGGGGTGGT GCCCATCCTG 

660 

GTCGAGCTGG ACGGCGACGT AAACGGCCAC AAGTTCAGCG TGTCCGGCGA GGGCGAGGGC 

720 

GATGCCACCT ACGGCAAGCT GACCCTGAAG TTCATCTGCA CCACCGGCAA GCTGCCCGTG 
780 m 

CCCTGGCCCA CCCTCGTGAC CACCCTGACC TACGGCGTGC AGTGCTTCAG CCGCTACCCC 

840 

GACCACATGA AGCAGCACGA CTTCTTCAAG TCCGCCATGC CCGAAGGCTA CGTCCAGGAG 

900 

CGCACCATCT TCTTCAAGGA CGACGGCAAC TACAAGACCC GCGCCGAGGT GAAGTTCGAG 

960 

GGCGACACCG TGGTGAACCG CATCGAGCTG AAGGGCATCG ACTTCAAGGA GGACGGCAAC 

1020 

ATCCTGGGGC ACAAGCTGGA GTACAACTAC AACAGCCACA ACGTCTATAT CATGGCCGAC 

1080 

AAGCAGAAGA ACGGCATCAA GGTGAACTTC AAGATCCGCC ACAACATCGA GGACGGCAGC 

1140 

GTGCAGCTCG CCGACCACTA CCAGCAGAAC ACCCCCATCG GCGACGGCCC CGTGCTGCTG 

1200 

CCCGACAACC ACTACCTGAG CACCCAGTCC GCCCTGAGCA AAGACCCCAA CGAGAAGCGC 

1260 

GATCACATGG TCC7GCTGGA GTTCGTGACC GCCGCCGGGA TCACTCTCGG CATGGACGAG 

1320 

CTGTACAAGT CCGGCCGGAC TCAGATCTCG AGCTCAAGCT TCGAATTCTG CAGTCGACGG 

1380 

TACCGCGGGC CCGGGATCCA CCGGATCTAG ATAACTGATC ATAATCAGCC ATACCACATT 

1440 

TGTAGAGGTT TTACTTGCTT TAAAAAACCT CCCACACCTC CCCCTGAACC TGAAACATAA 

1500 

AATGAATGCA ATTGTTGTTG TTAACTTGTT TATTGCAGCT TATAATGGTT ACAAATAAAG 

1560 
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CAATAGCATC ACAAATTTCA CAAATAAAGC ATTTTTTTCA CTGCATTCTA GTTGTGGTTT 

1620 

GTCCAAACTC ATCAATGTAT CTTAACGCGT AAATTGTAAG CGTTAATATT TTGTTAAAAT 

1680 

TCGCGTTAAA TTTTTGTTAA ATCAGCTCAT TTTTTAACCA ATAGGCCGAA ATCGGCAAAA 

1740 

TCCCTTATAA ATCAAAAGAA TAGACCGAGA TAGGGTTGAG TGTTGTTCCA GTTTGGAACA 
1800 . 

AGAGTCCACT ATTAAAGAAC GTGGACTCCA ACGTCAAAGG GCGAAAAACC GTCTATCAGG 
18 60 - 

GCGATGGCCC ACTACGTGAA CCATCACCCT AATCAAGTTT TTTGGGGTCG AGGTGCCGTA 

1920 

AAGCACTAAA TCGGAACCCT AAAGGGAGCC CCCGATTTAG AGCTTGACGG GGAAAGCCGG 

1980 

CGAACGTGGC GAGAAAGGAA GGGAAGAAAG CGAAAGGAGC GGGCGCTAGG GCGCTGGCAA 
2040 . "\ - 

GTGtAGCGGT CACGCTGCGC GTAACCACCA CACCCGCCGC GCTTAATGCG CCGCTACAGG 

2100 : 

GCGCGTCAGG TGGCACTTTT CGGGGAAATG TGCGCGGAAC CCCTATTTGT TTATTTTTCT 

2160 

AAATACATTC AAATATGTAT CCGCTCATGA GACAATAACC CTGATAAATG CTTCAATAAT 

2220 

ATTGAAAAAG GAAGAGTCCT GAGGCGGAAA GAACCAGCTG TGGAATGTGT GTCAGTTAGG 

2280 

GTGTGGAAAG TCCCCAGGCT CCCCAGCAGG CAGAAGTATG CAAAGCATGC ATCTCAATTA 
2340 / 

GTCAGCAACC AGGTGTGGAA AGTCCCCAGG CTCCCCAGCA GGCAGAAGTA TGCAAAGCAT 

2400 - V, *Vy .' .- " 

- gcatctcAat TAGTCAGCAA ccatagtccc gcccctaact ccgcccatcc cgcccctaac 

24 60 -V^ '.' 

' TCCGCCCAGT TCCGCCCATT CTCCGCCCCA TGGCTGACTA ATTTTTTTTA TTTATGCAGA 
2520 ' : 

GGCCGAGGCC GCCTCGGCCT CTGAGCTATT CCAGAAGTAG TGAGGAGGCT TTTTTGGAGG 
2580 ^ 

■ CCTAGGCTTT TGCAAAGATC GATCAAGAGA CAGGATGAGG ATCGTTTCGC ATGATTGAAC 
2640 •'. 

AAGATGGATT GCACGCAGGT TCTCCGGCCG CTTGGGTGGA GAGGCTATTC GGCTATGACT 

2700 

' GGGCACAACA GACAATCGGC TGCTCTGATG CCGCCGTGTT CCGGCTGTCA GCGCAGGGGC 
2760 ^ v 

GCCCGGTTCT TTTTGTCAAG ACCGACCTGT CCGGTGCCCT GAATGAACTG CAAGACGAGG 
2820' ; , • • 

' CAGCGCGGCT ATCGTGGCTG GCCACGACGG GCGTTCCTTG CGCAGCTGTG CTCGACGTTG 
2880 

TCACTGAAGC GGGAAGGGAC TGGCTGCTAT TGGGCGAAGT GCCGGGGCAG GATCTCCTGT 

2940 

CATCTCACCT TGCTCCTGCC GAGAAAGTAT CCATCATGGC TGATGCAATG CGGCGGCTGC 
3000 .-; ' ; . .-• 

ATACGCTTGA TCCGGCTACC TGCCCATTCG ACCACCAAGC GAAACATCGC ATCGAGCGAG 

3060 

CACGTACTCG GATGGAAGCC GGTCTTGTCG ATCAGGATGA TCTGGACGAA GAGCATCAGG 

3120 

GGCTCGCGCC AGCCGAACTG TTCGCCAGGC TCAAGGCGAG CATGCCCGAC GGCGAGGATC 

3180 

TCGTCGTGAC CCA7GGCGAT GCCTGCTTGC CGAATATCAT GGTGGAAAAT GGCCGCTTTT 

3240 
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CTGGATTCAT CGACTGTGGC CGGCTGGGTG TGGCGGACCG CTATCAGGAC ATAGCGTTGG 

3300 

CTACCCGTGA TATTGCTGAA GAGCTTGGCG GCGAATGGGC TGACCGCTTC CTCGTGCTTT. 

3360 

ACGGTATCGC CGCTCCCGAT TCGCAGCGCA TCGCCTTCTA TCGCCTTCTT GACGAGTTCT 

3420 

TCTGAGCGGG ACTCTGGGGT TCGAAATGAC CGACCAAGCG ACGCCCAACC TGCCATCACG 

3480 

AGATTTCGAT TCCACCGCCG .CCTTCTATGA AAGGTTGGGC TTCGGAATCG TTTTCCGGGA 

3540 

CGCCGGCTGG ATGATCCTCC AGCGCGGGGA TCTCATGCTG GAGTTCTTCG CCCACCCTAG 

3600 

GGGGAGGCTA ACTGAAACAC GGAAGGAGAC AATACCGGAA GGAACCCGCG CTATGACGGC 

3660 

AATAAAAAGA CAGAATAAAA CGCACGGTGT TGGGTCGTTT GTTCATAAAC GCGGGGTTCG 

3720 

GTCCCAGGGC TGGCACTCTG TCGATACCCC ACCGAGACCC CATTGGGGCC AATACGCCCG 

3780 

CGTTTCTTCC TTTTCCCCAC CCCACCCCCC AAGTTCGGGT GAAGGCCCAG GGCTCGCAGC 

3840 

CAACGTCGGG GCGGCAGGCC CTGCCATAGC CTCAGGTTAC TCATATATAC TTTAGATTGA 

3900 

TTTAAAACTT CATTTTTAAT TTAAAAGGAT CTAGGTGAAG ATCCTTTTTG ATAATCTCAT 

3960 

GACCAAAATC CCTTAACGTG AGTTTTCGTT CCACTGAGCG TCAGACCCCG TAGAAAAGAT 

4020 

CAAAGGATCT TCTTGAGATC CTTTTTTTCT GCGCGTAATC TGCTGCTTGC AAACAAAAAA 

4080 

ACCACCGCTA CCAGCGGTGG TTTGTTTGCC GGATCAAGAG CTACCAACTC TTTTTCCGAA 

4140 

GGTAACTGGC TTCAGCAGAG CGCAGATACC AAATACTGTC CTTCTAGTGT AGCCGTAGTT 

4200 

AGGCCACCAC TTCAAGAACT CTGTAGCACC GCCTACATAC CTCGCTCTGC TAATCCTGTT 

4260 

ACCAGTGGCT GCTGCCAGTG GCGATAAGTC GTGTCTTACC GGGTTGGACT CAAGACGATA 

4320 

GTTACCGGAT AAGGCGCAGC GGTCGGGCTG AACGGGGGGT TCGTGCACAC AGCCCAGCTT 

4380 

GGAGCGAACG ACCTACACCG AACTGAGATA CCTACAGCGT GAGCTATGAG AAAGCGCCAC 

4440 

GCTTCCCGAA GGGAGAAAGG CGGACAGGTA TCCGGTAAGC GGCAGGGTGG GAACAGGAGA 

4500 

GCGCACGAGG GAGCTTCCAG GGGGAAACGC CTGGTATCTT TATAGTCCTG TCGGGTTTCG 

4560 

CCACCTCTGA CTTGAGCGTC GATTTTTGTG ATGCTCGTCA GGGGGGCGGA GCCTATGGAA 

4620 

AAACGCCAGC AACGCGGCCT TTTTACGGTT CCTGGCCTTT TGCTGGCCTT TTGCTCACAT 

4680 

GTTCTTTCCT GCGTTATCCC CTGATTCTGT GGATAACCGT ATTACCGCCA TGCAT 

4735 
// 
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SQ SEQUENCE 5628 BP 

TCGACGGTAC CGCGGGCCCG GGATCCACCG GATCTAGATA ACTGATCATA ATCAGCCATA 

60 

CCACATTTGT AGAGGTTTTA CTTGCTTTAA AAAACCTCCC ACACCTCCCC CTGAACCTGA 

120 

AACATAAAAT GAATGCAATT GTTGTTGTTA ACTTGTTTAT TGCAGCTTAT AATGGTTACA 

180 

AATAAAGCAA TAGCATCACA AATTTCACAA ATAAAGCATT TTTTTCACTG CATTCTAGTT 

240 

GTGGTTTGTC CAAACTCATC AATGTATCTT AACGCGTAAA TTGTAAGCGT TAATATTTTG 

300 

TTAAAATTCG CGTTAAATTT TTGTTAAATC AGCTCATTTT TTAACCAATA GGCCGAAATC 

360 

GGCAAAATCC CTTATAAATC AAAAGAATAG ACCGAGATAG GGTTGAGTGT TGTTCCAGTT 

420 

TGGAACAAGA GTCCACTATT AAAGAACGTG GACTCCAACG TCAAAGGGCG AAAAACCGTC 

480 

TATCAGGGCG ATGGCCCACT ACGTGAACCA TCACCCTAAT CAAGTTTTTT GGGGTCGhGG 

540 

TGCCGTAAAG CACTAAATCG GAACCCTAAA GGGAGCCCCC GATTTAGAGC TTGACGGGGA 

600 

AAGCCGGCGA ACGTGGCGAG AAAGGAAGGG AAGAAAGCGA AAGGAGCGGG CGCTAGGGCG 

660 

CTGGCAAGTG TAGCGGTCAC GCTGCGCGTA ACCACCACAC CCGCCGCGCT TAATGCGCCG 

720 

CTACAGGGCG CGTCAGGTGG CACTTTTCGG GGAAATGTGC GCGGAACCCC TATTTGTTTA 

780 

TTTTTCTAAA TACATTCAAA TA7GTATCCG CTCATGAGAC AATAACCCTG ATAAATGCTT 

840 

CAATAATATT GAAAAAGGAA GAGTCCTGAG GCGGAAAGAA CCAGCTGTGG AATGTGTGTC 

900 

AGTTAGGGTG TGGAAAGTCC CCAGGCTCCC CAGCAGGCAG AAGTATGCAA AGCATGCATC 

960 

TCAATTAGTC AGCAACCAGG TGTGGAAAGT CCCCAGGCTC ' CCCAGCAGGC AGAAGTATGC 

1020 

AAAGCATGCA TCTCAATTAG TCAGCAACCA TAGTCCCGCC CCTAACTCCG CCCATCCCGC 

1080 

. CCCTAACTCC GCCCAGTTCC GCCCATTCTC CGCCCCATGG CTGACTAATT TTTTTTATTT 
1140 

ATGC AG AG GC CGAGGCCGCC TCGGCCTCTG AGCTATTCCA GAAGTAGTGA GGAGGCTTTT 

1200 

TTGGAGGCCT AGGCTTTTGC AAAGATCGAT CAAGAGACAG GATGAGGATC GTTTCGCATG 

1260 

ATTGAACAAG ATGGATTGCA CGCAGGTTCT CCGGCCGCTT GGGTGGAGAG GCTATTCGGC 

1320 

TATGACTGGG CACAACAGAC AATCGGCTGC TCTGATGCCG CCGTGTTCCG GCTGTCAGCG 

1380 

CAGGGGCGCC CGGTTCTTTT TGTCAAGACC GACCTGTCCG GTGCCCTGAA TGAACTGCAA 

1440 

GACGAGGCAG CGCGGCTATC GTGGCTGGCC ACGACGGGCG TTCCTTGCGC AGCTGT.GCTC 

1500 
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GACGTTGTCA CTGAAGCGGG AAGGGACTGG CTGCTATTGG GCGAAGTGCC GGGGCAGGAT 
1560 - 

CTCCTGTCAT CTCACCTTGC TCCTGCCGAG AAAGTATCCA TCATGGCTGA TGCAATGCGG 

1620 

CGGCTGCATA CGCTTGATCC GGCTACCTGC CCATTCGACC ACCAAGCGAA ACATCGCATC 

1680 

GAGCGAGCAC GTACTCGGAT GGAAGCCGGT CTTGTCGATC AGGATGATCT GGACGAAGAG 

1740 

CATCAGGGGC TCGCGCCAGC CGAACTGTTC GCCAGGCTCA AGGCGAGCAT GCCCGACGGC 

1800 

GAGGATCTCG TCGTGACCCA TGGCGATGCC TGCTTGCCGA ATATCATGGT GGAAAATGGC 

I860 

CGCTTTTCTG GATTCATCGA CTGTGGCCGG CTGGGTGTGG CGGACCGCTA TCAGGACATA 
1920 - 

GCGTTGGCTA CCCGTGATAT TGCTGAAGAG CTTGGCGGCG AATGGGCTGA CCGCTTCCTC 
1980 . .. 

GTGCTTTACG GTATCGCCGC TCCCGATTCG CAGCGCATCG CCTTCTATCG CCTTCTTGAC 

2040 

GAGTTCTTCT GAGCGGGACT CTGGGGTTCG AAATGACCGA CCAAGCGACG CCCAACCTGC 

2100 

CATCACGAGA TTTCGATTCC ACCGCCGCC.T TCTATGAAAG GTTGGGCTTC GGAATCGTTT 

2160 ,; 

TCCGGGACGC CGGCTGGATG ATCCTCCAGC GCGGGGATCT CATGCTGGAG TTCTTCGCCC 

2220 

ACCCTAGGGG GAGGCTAACT GAAACACGGA AGGAGACAAT ACCGGAAGGA ACCCGCGCTA 
2280 ^ ; 

... TGACGGCAAT AAAAAGACAG AATAAAACGC ACGGTGTTGG GTCGTTTGTT CATAAACGCG 
2340 • ; • . - * , 

GC3GTTCGGTC CCAGGGCTGG CACTCTGTCG ATACCCCACC GAGACCCCAT TGGGGCCAAT 

2400 

ACGCCCGCGT TTCTTCCTTT TCCCCACCCC ACCCCCCAAG TTCGGGTGAA GGCCCAGGGC 

24 60 

TCGCAGCCAA CGTCGGGGCG GCAGGCCCTG CCATAGCCTC AGGTTACTCA TATATACTTT 
2520 ' • '.; • . • : , 

. AGATTGATTT AAAACTTCAT TTTTAATTTA AAAGGATCTA GGTGAAGATC CTTTTTGATA . 

2580 „; V \ 

ATCTCATGAC CAAAATCCCT TAACGTGAGT TTTCGTTCCA CTGAGCGTCA GACCCCGTAG 
2640 ..... 

AAAAGATCAA AGGATCTTCT TGAGATCCTT TTTTTCTGCG CGTAATCTGC TGCTTGCAAA 

2700 \.m^^ - " 

. ; CAAAAAAACC ACCGCTACCA GCGGTGGTTT GTTTGCCGGA TCAAGAGCTA CCAACTCTTT . 

2760 '-■'^■'■'A.'. ' • • • 

; TTCCGAAGGT AACTGGCTTC AGCAGAGCGC AGATACCAAA TACTGTCCTT CTAGTGTAGC 
2820 

CGTAGTTAGG CCACCACTTC AAGAACTCTG TAGCACCGCC TACATACCTC GCTCTGCTAA 
2880 ,. 

TCCTGTTACC AGTGGCTGCT GCCAGTGGCG ATAAGTCGTG TCTTACCGGG TTGGACTCAA 
2940 ' 

GACGATAGTT ACCGGATAAG GCGCAGCGGT CGGGCTGAAC GGGGGGTTCG TGCAGACAGC 

3000 

CCAGCTTGGA GCGAACGACC TACACCGAAC TGAGATACCT ACAGCGTGAG CTATGAGAAA 

3060 

GCGCCACGCT TCCCGAAGGG AGAAAGGCGG ACAGGTATCC GGTAAGCGGC AGGGTCGGAA 

3120 

CAGGAGAGCG CACGAGGGAG CTTCCAGGGG GAAACGCCTG GTATCTTTAT AGTCCTGTCG 

3180 



WO 99/64586 



PCT/EP99/04043 



GGTTTCGCCA CCTCTGACTT GAGCGTCGAT TTTTGTGATG CTCGTCAGGG GGGCGGAGCC 

. 3240 

TATGGAAAAA CGCCAGCAAC GCGGCCTTTT TACGGTTCCT GGCCTTTTGC TGGCCTTTTG 

3300 

CTCACATGTT CTTTCCTGCG TTATCCCCTG ATTCTGTGGA TAACCGTATT ACCGCCATGC 

3360 

ATTAGTTATT AATAGTAATC AATTACGGGG TCATTAGTTC ATAGCCCATA TATGGAGTTC 

3420 

CGCGTTACAT AACTTACGGT AAATGGCCCG CCTGGCTGAC CGCCCAACGA CCCCCGCCCA 

3480 

TTGACGTCAA TAATGACGTA TGTTCCCATA GTAACGCCAA TAGGGACTTT CCATTGACGT 

3540 

CAATGGGTGG AGTATTTACG GTAAACTGCC CACTTGGCAG TACATCAAGT GTATCATATG 

3600 

CCAAGTACGC CCCCTATTGA CGTCAATGAC GGTAAATGGC CCGCCTGGCA TTATGCCCAG 

3660 

TACATGACCT TATGGGACTT TCCTACTTGG CAGTACATCT ACGTATTAGT CATCGCTATT 

3720 

ACCATGGTGA TGCGGTTTTG GCAGTACATC AATGGGCGTG GATAGCGGTT TGACTCACGG 

3780 

GGATTTCCAA GTCTCCACCC CATTGACGTC AATGGGAGTT TGTTTTGGCA CCAAAATCAA 

3840 

CGGGACTTTC CAAAATGTCG TAACAACTCC GCCCCATTGA CGCAAATGGG CGGTAGGCGT 

3900 

GTACGGTGGG AGGTCTATAT AAGCAGAGCT GGTTTAGTGA ACCGTCAGAT CCGCTAGCGC 

3960 

TACCGGTCGC CACCATGGTG AGCAAGGGCG AGGAGCTGTT CACCGGGGTG GTGCCCATCC 

4020 

TGGTCGAGCT GGACGGCGAC GTAAACGGCC ACAAGTTCAG CGTGTCCGGC GAGGGCGAGG 

4080 

GCGATGCCAC CTACGGCAAG CTGACCCTGA AGTTCATCTG CACCACCGGC AAGCTGCCCG 

4140 

TGCCCTGGCC CACCCTCGTG ACCACCCTGA CCTACGGCGT GCAGTGCTTC AGCCGCTACC 

4200 

CCGACCACAT GAAGCAGCAC GACTTCTTCA AGTCCGCCAT GCCCGAAGGC TACGTCCAGG 

4260 

AGCGCACCAT CTTCTTCAAG GACGACGGCA ACTACAAGAC CCGCGCCGAG GTGAAGTTCG 

4320 

AGGGCGACAC CCTGGTGAAC CGCATCGAGC TGAAGGGCAT CGACTTCAAG GAGGACGGCA 

4380 

ACATCCTGGG GCACAAGCTG GAGTACAACT ACAACAGCCA CAACGTCTAT ATCATGGCCG 

4440 

ACAAGCAGAA GAACGGCATC AAGGTGAACT TCAAGATCCG CCACAACATC GAGGACGGCA 

4500 

GCGTGCAGCT CGCCGACCAC TACCAGCAGA ACACCCCCAT CGGCGACGGC CCCGTGCTGC 

4560 

TGCCCGACAA CCACTACCTG AGCACCCAGT CCGCCCTGAG CAAAGACCCC AACGAGAAGC 
4 620 - 

GCGATCACAT GGTCCTGCTG GAGTTCGTGA CCGCCGCCGG GATCACTCTC GGCATGGACG 

4680 

AGCTGTACAA GTCCGGCCGG ACTCAGATCC CCATGAACCG TGCTTTTAGC AGGAAGAAAG 

4740 

ACAAAACATG GATGCATACA CCTGAAGCTT TATCAAAACA TTTCATTCCC TATAA T GCAA 

4800 

AGTTTCTTGG CAGTACAGAA GTGGAACAGC CAAAAGGAAC AGAAGTTGTG "AGAGATGCTG 

4860 



WO 99/64586 



PCT/EP99/04043 




TAAGGAAACT AAAGTTTGCA AG AC AT AT C A AGAAATCTGA AGGCCAGAAA ATTCCTAAAG 

4920 

TGGAGTTGCA AATATCAATT TATGGAGTAA AAATTCTAGA ACCCAAAACA AAGGAAGTTC 

4 980 

, AACACAATTG CCAGCTTCAT AGAATATCTT TTTGTGCAGA TGATAAAACT GACAAGAGGA 
5040 : 

TATTCACTTT CATATGCAAA GATTCTGAGT CAAATAAACA TTTGTGCTAT GTATTTGACA 
5100 ... 

GCGAAAAGTG TGCTGAAGAG ATCACTTTAA CAATTGGCCA AGCATTTGAC CTGGCATACA 
5160 ; ' 

CGAAATTTCT AGAATCAGGA GGAAAAGATG TTGAAACAAG AAAACAGATC GCAGGGTTAC 
5220 / w : 

AAAAAAGAAT CCAAGACTTA GAAACAGAAA ATATGGAACT TAAAAATAAA GTACAAGATT 
5280/-"^^ . . ■ 

v TGGAAAACCA ACTGAGAATA ACTCAAGTAT CAGCACCTCC AGCAGGCAGT ATGACACCTA 

5340" • . ' " ' '. : " 

. ./ AGTCGCCCTC CACTGACATC TTTGATATGA TTCCATTTTC TCCAATATCA CACCAGTCTT 
5400 

CGATGCCTAC TCGCAATGGC ACACAGCCAC CTCCAGTACC TAGTAGATCT ACTGAGATTA 

54 60 

AACGGGACCT GTTTGGAGCA GAACCTTTTG ACCCATTTAA CTGTGGAGCA GCAGATTTCC 

5520 

CTCCAGATAT TCAATCAAAA TTAGATGAGA TGCAGGAGGG GTTCAAAATG GGACTAACTC 
5580 . •• ' ;■ 

TTGAAGGCAC AGTATTTTGT CTCGACCCGT TAGACAGTAG GTGCTGAG 

5628 

// ... 
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SQ SEQUENCE 6121 BP 



GATCTATGGG CTGTGACCGG AACTGTGGGC TCATCGCTGG GGCTGTCATT GGTGCTGTCC 

60 

TGGCTGTGTT TGGAGGTATT CTAATGCCAG TTGGAGACCT GCTTATCCAG AAGACAATTA 

120 

AAAAGCAAGT TGTCCTCGAA GAAGGTACAA TTGCTTTTAA AAATTGGGTT AAAACAGGCA 

180 

CAGAAGTTTA CAGACAGTTT TGGATCTTTG ATGTGCAAAA TCCACAGGAA GTGATGATGA 

240 

ACAGCAGCAA CATTCAAGTT AAGCAAAGAG GTCCTTATAC GTACAGAGTT CGTTTTCTAG 
300 - 

CCAAGGAAAA TGTAACCCAG GACGCTGAGG ACAACACAGT CTCTTTCCTG CAGCCCAATG 
360 • 

GTGCCATCTT CGAACCTTCA CTATCAGTTG GAACAGAGGC TGACAACTTC ACAGTTCTCA 

420 

ATCTGGCTGT GGCAGCTGCA TCCCATATCT ATCAAAATCA ATTTGTTCAA ATGATCCTCA 

480 

ATTCACTTAT TAACAAGTCA AAATCTTCTA TGTTCCAAGT CAGAACTTTG AGAGAACTGT 

540 

TATGGGGCTA TAGGGATCCA TTTTTGAGTT TGGTTCCGTA CCCTGTTACT ACTACAGTTG 

600 

GTCTGTTTTA TCCTTACAAC AATACTGCAG ATGGAGTTTA TAAAGTTTTC AATGGAAAAG 

660 Y: ; ; • 

. ATAACATAAG TAAAGTTGCC ATAATCGACA CATATAAAGG TAAAAGGAAT CTGTCCTATT 

720... ' ... ■ 

GGG AAAGTCA CTGCGACATG ATTAATGGTA CAGATGCAGC CTCATTTCCA CCTTTTGTTG 
780 •'• 

AGAAAAGCCA GGTATTGCAG TTCTTTTCTT CTGATATTTG CAGGTCAATC TATGCTGTAT 
840 - 

TTGAATCCGA CGTTAATCTG AAAGGAATCC CTGTGTATAG ATTCGTTCTT CCATCCAAGG 

900 

, CCTTTGCCTC TCCAGTTGAA AACCCAGACA ACTATTGTTT CTGCACAGAA AAAATTATCT 
960 

CAAAAAATTG TACATCATAT GGTGTGCTAG ACATCAGCAA ATGCAAAGAA GGGAG ACCTG 
1020 . . 

TGTACATTTC ACTTCCTCAT TTTCTGTATG CAAGTCCTGA TGTTTCAGAA CCTATTGATG 
1080 v 

GATTAAACCC AAATGAAGAA GAACATAGGA CATACTTGGA TATTCAACCT ATAACTGGAT 

1140 

• • TCACTTTACA ATTTGCAAAA CGGCTGCAGG TCAACCTATT GGTCAAGCCA TCAGAAAAAA 
1200 

TTCAAGTATT AAAGAATCTG AAGAGGAACT ATATTGTGCC TATTCTTTGG CTTAATGAGA 

1260 

; CTGGGACCAT TGGTGATGAG AAGGCAAACA TGTTCAGAAG TCAAGTAACT GGAAAAATAA 
1320 

ACCTCCTTGG CCTGATAGAA ATGATCTTAC TCAGTGTTGG TGTGGTGATG TTTGTTGCTT 

1380 

TTATGATTTC ATATTGTGCA TGCAGATCGA AAACAATAAA AGTCGACGGT ACCGCGGGCC 

1440 

CGGGATCCAT CGCCACCATG GTGAGCAAGG GCGAGGAGCT GTTCACCGGG GTGGTGCCCA 

1500 
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TCCTGGTCGA GCTGGACGGC GACGTAAACG GCCACAAGTT CAGCGTGTCC GGCGAGGGCG 

1560 

AGGGCGATGC CACCTACGGC AAGCTGACCC TGAAGTTCAT CTGCACCACC GGCAAGCTGC 

1620 

CCGTGCCCTG GCCCACCCTC GTGACCACCC TGACCTACGG CGTGCAGTGC TTCAGCCGCT 

1680 

ACCCCGACCA CATGAAGCAG CACGACTTCT TCAAGTCCGC CATGCCCGAA GGCTACGTCC 

1740 

AGGAGCGCAC CATCTTCTTC AAGGACGACG GCAACTACAA GACCCGCGCC GAGGTGAAGT 

1800 

TCGAGGGCGA CACCCTGGTG AACCGCATCG AGCTGAAGGG CATCGACTTC AAGGAGGACG 

1860 

GCAACATCCT GGGGCACAAG CTGGAGTACA ACTACAACAG CCACAACGTC TATATCATGG 

1920 

CCGACAAGCA GAAGAACGGC ATCAAGGTGA ACTTCAAGAT CCGCCACAAC ATC GAGGACG 

1980 

GCAGCGTGCA GCTCGCCGAC CACTACCAGC AGAACACCCC CATCGGCGAC GGCCCCGTGC 

2040 

TGCTGCCCGA CAACCACTAC CTGAGCACCC AGTCCGCCCT GAGCAAAGAC CCCAACGAGA 

2100 

AGCGCGATCA CATGGTCCTG CTGGAGTTCG TGACCGCCGC CGGGATCACT CTCGGCATGG 

2160 

ACGAGCTGTA CAAGTAAAGC GGCCGCGACT CTAGATCATA ATCAGCCATA CCACATTTGT 

2220 

AGAGGTTTTA CTTGCTTTAA AAAACCTCCC ACACCTCCCC CTGAACCTGA AACATAAAAT 

2280 

GAATGCAATT GTTGTTGTTA ACTTGTTTAT TGCAGCTTAT AATGGTTACA AATAAAGCAA 

234 0 

TAGCATCACA AATTTCACAA ATAAAGCATT TTTTTCACTG CATTCTAGTT GTGGTTTGTC 

2400 

CAAACTCATC AATGTATCTT AAGGCGTAAA TTGTAAGCGT TAATATTTTG TTAAAATTCG 

2460 

CGTTAAATTT TTGTTAAATC AGCTCATTTT TTAACCAATA GGCCGAAATC GGCAAAATCC 

2520 

CTTATAAATC AAAAGAATAG ACCGAGATAG GGTTGAGTGT TGTTCGAGTT TGGAACAAGA 

2580 

GTCCACTATT AAAGAACGTG GACTCCAACG TCAAAGGGCG AAAAACCGTC TATCAGGGCG 

2640 

ATGGCCCACT ACGTGAACCA TCACCCTAAT CAAGTTTTTT GGGGTCGAGG TGCCGTAAAG 

2700 

CACTAAATCG GAACCCTAAA GGGAGCCCCC GATTTAGAGC TTGACGGGGA AAGCCGGCGA 

2760 

ACGTGGCGAG AAAGGAAGGG AAGAAAGCGA AAGGAGCGGG CGCTAGGGCG CTGGCAAGTG 

2820 

TAGCGGTCAC GCTGCGCGTA ACCACCACAC CCGCCGCGCT TAATGCGCCG CTACAGGGCG 

2880 

CGTCAGGTGG CACTTTTCGG GGAAATGTGC GCGGAACCCC TATTTGTTTA TTTTTCTAAA 

2940 

. TACATTCAAA TATGTATCCG CTCATGAGAC AATAACCCTG ATAAATGCTT CAATAATATT 
3000 

GAAAAAGGAA GAGTCCTGAG GCGGAAAGAA CCAGCTGTGG AATGTGTGTC AGTTAGGGTG 

3060 

TGGAAAGTCC CCAGGCTCCC CAGCAGGCAG AAGTATGCAA AGCATGCATC TCAATTAGTC 

3120 

AGCAACCAGG TGTGGAAAGT CCCCAGGCTC CCCAGCAGGC AGAAGTATGC AAAGCATGCA 

3180 
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TCTCAATTAG TCAGCAACCA TAGTCCCGCC CCTAACTCCG CCCATCCCGC CCCTAACTCC 

3240 

GCCCAGTTCC GCCCATTCTC CGCCCCATGG CTGACTAATT TTTTTTATTT ATGCAGAGGC 

3300 

CGAGGCCGCC TCGGCCTCTG AGCTATTCCA GAAGTAGTGA GGAGGCTTTT TTGGAGGCCT 
3360 ... . 

AGGCTTTTGC AAAGATCGAT CAAGAGACAG GATGAGGATC GTTTCGCATG ATTGAACAAG 

3420 

ATGGATTGCA CGCAGGTTCT CCGGCCGCTT GGGTGGAGAG GCTATTCGGC TATGACTGGG 

3480 

CACAACAGAC AATCGGCTGC TCTGATGCCG CCGTGTTCCG GCTGTCAGCG CAGGGGCGCC 
.3540 , . 

CGGTTCTTTT TGTCAAGACC GACCTGTCCG GTGCCCTGAA TGAACTGCAA GACGAGGCAG 

3600 

CGCGGCTATC GTGGCTGGCC ACGACGGGCG TTCCTTGCGC AGCTGTGCTC GACGTTGTCA 
3660 ; : 

CTGAAGCGGG AAGGGACTGG CTGCTATTGG GCGAAGTGCC GGGGCAGGAT CTCCTGTCAT 
3720 . ' . 

CTCAtCTTGC TCCTGCCGAG AAAGTATCCA TCATGGCTGA TGCAATGCGG CGGCTGCATA 
3780 ' J.; 

CGCTTGATCC GGCTACCTGC CCATTCGACC ACCAAGCGAA ACATCGCATC GAGCGAGCAC 

3840 

GTACTCGGAT GGAAGCCGGT CTTGTCGATC AGGATGATCT GGACGAAGAG CATCAGGGGC 
3900 --v , 

.: TCjGCGCCAGC CGAACTGTTC GCCAGGCTCA AGGCGAGCAT GCCCGACGGC GAGGATCTCG 
3960 . - . ■ ■■ 

TCGTGACCCA TGGCGATGCC TGCTTGCCGA ATATCATGGT. GGAAAATGGC CGCTTTTCTG 
4020 ' 

. GATTCATCGA CTGTGGCCGG CTGGGTGTGG CGGACGGCTA TCAGGACATA GCGTTGGCTA 

4080 ; 

CCCGTGATAT TGCTGAAGAG CTTGGCGGCG AATGGGCTGA CCGCTTCCTC GTGCTTTACG 

4140 

GTATCGCCGC TCCCGATTCG CAGCGCATCG CCTTCTATCG CCTTCTTGAC GAGTTCTTCT 

4200 

. GAGCGGGACT CTGGGGTTCG AAATGACCGA CCAAGCGACG CCCAACCTGC CATCACGAGA 
4260 

- TTTCGATTCC ACCGCCGCCT TCTATGAAAG GTTGGGCTTC GGAATCGTTT TCCGGGACGC 
4320". yjj^^., . 

CGGCTGGATG ATCCTCCAGC GCGGGGATCT CATGCTGGAG TTCTTCGCCC ACCCTAGGGG 

- GAGGCTAACT GAAACACGGA AGGAGACAAT ACCGGAAGGA ACCCGCGCTA TGACGGCAAT 
4440 -'l^ . . 

AAAAAGACAG AATAAAACGC ACGGTGTTGG GTCGTTTGTT CATAAACGCG GGGTTCGGTC 

4500 

CCAGGGCTGG CACTCTGTCG ATACCCCACC GAGACCCCAT TGGGGCCAAT ACGCCCGCGT 

4560 

TTCTTCCTTT TCCCCACCCC ACCCCCCAAG TTCGGGTGAA GGCCCAGGGC TCGCAGCCAA 

4 620 

CGTCGGGGCG GCAGGCCCTG CCATAGCCTC AGGTTACTCA TATATACTTT AGATTGATTT 

4680 

AAAACTTCAT TTTTAATTTA AAAGGATCTA GGTGAAGATC CTTTTTGATA ATCTCATGAC 

4740 

CAAAATCCCT TAACGTGAGT" TTTCGTTCCA CTGAGCGTCA GACCCCGTAG AAAAGATCAA 

4800 

AGGATCTTCT TGAGATCCTT TTTTTCTGCG CGTAATCTGC TGCTTGCAAA CAAAAAAACC 

4860 
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ACCGCTACCA GCGGTGGTTT GTTTGCCGGA TCAAGAGCTA CCAACTCTTT TTCCGAAGGT 

4920 

AACTGGCTTC AGCAGAGCGC AGATACCAAA TACTGTCCTT CTAGTGTAGC CGTAGTTAGG 

4980 

CCACCACTTC AAGAACTCTG TAGCACCGCC TACATACCTC GCTCTGCTAA TCCTGTTACC 

5040 

AGTGGCTGCT GCCAGTGGCG ATAAGTCGTG TCTTACCGGG TTGGACTCAA GACGATAGTT 

5100 

ACCGGATAAG GCGCAGCGGT CGGGCTGAAC GGGGGGTTCG TGCACACAGC CCAGCTTGGA 

5160 

GCGAACGACC TACACCGAAC TGAGATACCT ACAGCGTGAG CTATGAGAAA GCGCCACGCT 

5220 

TCCCGAAGGG AGAAAGGCGG ACAGGTATCC GGTAAGCGGC AGGGTCGGAA CAGGAGAGCG 

5280 

CACGAGGGAG CTTCCAGGGG GAAACGCCTG GTATCTTTAT AGTCCTGTCG GGTTTCGCCA 

5340 

CCTCTGACTT GAGCGTCGAT TTTTGTGATG CTCGTCAGGG GGGCGGAGCC TATGGAAAAA 

5400 

CGCCAGCAAC GCGGCCTTTT TACGGTTCCT GGCCTTTTGC TGGCCTTTTG CTCACATGTT 

5460 

CTTTCCTGCG TTATCCCCTG ATTCTGTGGA TAACCGTATT ACCGCCATGC ATTAGTTATT 

5520 

. AATAGTAATC AATTACGGGG TCATTAGTTC ATAGCCCATA TATGGAGTTC CGCGTTACAT 

5580 

AACTTACGGT AAATGGCCCG CCTGGCTGAC CGCCCAACGA CCCCCGCCCA TTGACGTCAA 

5640 

TAATGACGTA TGTTCCCATA GTAACGCCAA TAGGGACTTT CCATTGACGT CAATGGGTGG 

5700 

AGTATTTACG GTAAACTGCC CACTTGGCAG TACATCAAGT G TAT CAT AT G CCAAGTACGC 

5760 

CCCCTATTGA CGTCAATGAC GGTAAATGGC CCGCCTGGCA TTATGCCCAG TACATGACCT 

5820 

TATGGGACTT TCCTACTTGG CAGTACATCT ACGTATTAGT CATCGCTATT ACCATGGTGA 
5880 /. . ; . . • 

TGCGGTTTTG GCAGTACATC AATGGGCGTG GATAGCGGTT TGACTCACGG GGATTTCCAA 

5940 

GTCTCCACCC CATTGACGTC AATGGGAGTT TGTTTTGGCA CCAAAATCAA CGGGACTTTC 

6000 

CAAAATGTCG TAACAACTCC GCCCCATTGA CGCAAATGGG CGGTAGGCGT GTACGGTGGG 

6060 

AGGTCTATAT AAGCAGAGCT GGTTTAGTGA ACCGTCAGAT CCGCTAGCGC TACCGGACTC 

6120 

A 

6121 
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